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This invention relates to polynu- 
clelotides encoding Glycoprotein B from 
the RFHV/KSHV subfamily of gamma 
herpes viruses, three members of which 
are characterized in detail. DNA extracts 
were obtained from Macaque nemestrina 
and Macaque mulatto monkeys affected 
with retroperitoneal fibromatosis (RF), and 
human AIDS patients affected with Kaposi's 
sarcoma (KS). The extracts were amplified 
using consensus-degenerate oligonucleotide 
probes designed from known . protein and 
DNA sequences of gamma herpes viruses. 
The nucleotide sequences of a 319 base pair 
fragment are about 76 % identical between 
RFHV1 and KSHV, and about 60-63 % 
identical with the closest related gamma 
herpes viruses outside the RFHV/KSHV 
subfamily. Protein sequences encoded within 
these fragments are about 91 % identical 
between RFHV1 and KSHV, and < ^ 65 
% identical to that of other gamma herpes 
viruses. The full-length KSHV Glycoprotein 
B sequence comprises a transmembrane 
domain near the N-terminus, and a 
plurality of potentially antigenic sites in the 
extracellular domain. Materials and methods 
are provided to characterize Glycoprotein 
B encoding regions of members of the 
RFHV/KSHV subfamily, including but not 

limited to RFHV1, RFHV2, and KSHV Peptides, polynucleotides, and antibodies of this invention can be used for diagnosing infection, 
and for eliciting an immune response against Glycoprotein B. 
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Glycqprotfim R of the RFHV/KSHV Subfamily of Herpes Viruses 



Technical Field 

5 The present invention relates generally to the field of virology, particularly viruses of the herpes 

family. More specifically, it relates to the identification and characterization of herpes virus Glycoprotein B 
molecules which are associated with fibroproliferatrve and neoplastic conditions in primates, including 
humans. 

10 Background Art 

Kaposi's Sarcoma is a disfiguring and potentially fatal form of hemorrhagic sarcoma. It is 
characterized by multiple vascular tumors that appear on the skin as darkly colored plaques or nodules. At 
the histological level, it is characterized by proliferation of relatively uniform spindle-shaped cells, forming 

15 fascicles and vascular slits. There is often evidence of plasma cells, T cells and monocytes in the 
inflammatory infiltrate. Death may ultimately ensue due to bleeding from gastrointestinal lesions or from an 
associated lymphoma. (See generally Martin et al., Finesmith et at.) 

Once a relatively obscure disease, it has leapt to public attention due to its association with AIDS. 
As many as 20% of certain AIDS-affected populations acquire Kaposi's during the course of the disease. 

20 Kaposi's Sarcoma occurs in other conditions associated with immunodeficiency, including kidney dialysis 
and therapeutic immunosuppression. However, the epidemiology of the disease has suggested that 
immunodeficiency is not the only causative factor. In particular, the high degree of association of Kaposi's 
with certain sexual practices suggests the involvement of an etiologic agent which is not the human 
immunodeficiency virus (Berel et al.). 

25 A herpes-viras-like DNA sequence has been identified in tissue samples from Kaposi's lesions 

obtained from AIDS patients (Chang et al., confirmed by Ambroziuk et al.). The sequence was obtained by 
representational difference analysis (Usitsyn et al.). in which DNA from affected and unaffected tissue 
were amplified using unrelated priming oligonucleotides, and then hybridized together to highlight 
differences between the cells. The sequence was partly identical to known sequences of the Epstein Barr 

30 Virus and herpesvirus saimiri. It coded for capstd and tegument proteins, two structural components 
sequestered in the viral interior. In a survey of tissues from various sources, the sequence was found in 
95% of Kaposi's sarcoma lesions, regardless of the patients* HIV status (Moore et al. 1995a). 21% of 
uninvolved tissue from the same patients was positive, while 5% of samples from a control population was 
positive. There was approximately 0.5% sequence variation between samples. 

35 The same sequence has been detected in body cavity lymphoma, a lymphomatous effusion with 

B-cell features, occurring uniquely in AIDS patients (Cesarman et al.). The copy number was higher in 
body cavity lymphoma, compared with Kaposi's Sarcoma. Other AIDS-assodated lymphomas were 
negative. The sequence has also been found in peripheral blood mononuclear cells of patients with 
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Castleman's disease (Dupin el al.). This is a condition characterized by morphologic features of 
angiofolicular hyperplasia, and associated with fever, adenopathy, and splenomegaly. The putative virus 
from which the sequence is derived has become known as Kaposi's Sarcoma associated Herpes Virus 
(KSHV). 

Using PCR in situ hybridization. BoshofT et al. have detected KSHV polynucleotide sequences in 
the cell types thought to represent neoplastic cells in Kaposi's sarcoma. Serological evidence supports an 
important role for KSHV in the etiology of Kaposi's sarcoma (O'Leary). Kedes et al. developed an 
immunofluorescence serological assay that detects antibody to a latency-associated nuclear antigen in B 
cells latently infected with KSHV, and found that KSHV seropositive j s high in patients with Kaposi's 
sarcoma. Gao et al. found that of 40 patients with Kaposi's sarcoma. 32 were positive for antibodies 
against KSHV antigens by an immunoblot assay, as compared with only 7 of 40 homosexual men without 
Kaposi's sarcoma immediately before the onset of AIDS. Miner et al. prepared KSHV antigens from a 
body cavity lymphoma cell line containing the genomes of both KSHV and Epstein-Ban- virus. Antibodies 
to one antigen, designated p40, were Identified in 32 of 48 HIV-1 infected patients with Kaposi's sarcoma, 
as compared with only 7 of 54 HIV-1 infected patients without Kaposi's sarcoma. 

Zhong et al. analyzed the expression of KSHV sequences in affected tissue at the messenger RNA 
level. Two small transcripts were found that represent the bulk of the virus specific RNA transcribed from 
the KSHV genome. One transcript was predicted to encode a small membrane protein; the other is an 
unusual poty-A RNA that accumulates in the nucleus and may have no protein encoding sequence. 
Messenger RNA was analyzed by cloning a plurality of overlapping KSHV genomic fragments that 
spanned the -120 kb KSHV genome from a lambda library of genomic DMA. The clones were used as 
probes for Northern analysis, but then* sequences were not obtained or disclosed. 

Moore et al. have partially characterized a KSHV genome fragment obtained from a body-cavity 
lymphoma. A 20.7 kb region of the genome was reportedly sequenced, although the sequence was not 
disclosed. 17 partial or complete open reading frames were present in this fragment, all except one having 
sequence and positional homology to other known gamma herpes virus genes, including the capsid 
maturation gene and the thymidine kinase gene. Phylogenetic analysis showed that KSHV was more 
closely related to equine herpes virus 2 and Saimiri virus than to Epstein Barr virus. The 20.7 kb region did 
not contain sequences encoding either Glycoprotein B or DNA polymerase. 

The herpes virus family as a whole comprises a number of multi-enveloped viruses about 100 ran in 
size, and capable of infecting vertebrates. (For general reviews, see. e.g.. Emery et al.. Fields et al ). The 
double-stranded DNA genome is unusually large - from about 88 to about 229 kilobases in length. It may 
produce over SO different transcripts at various stages in the life cycle of the virus. A number of 
glycoproteins are expressed at the viral surface, and play a role in recognition of a target cell by the virus, 
and penetration of the virus into the cell. These surface proteins are relatively more variant between 
species, compared with internal viral components (Karlin et al.). The same surface proteins are also 
present on defective viral particles produced by cells harboring the virus. One such non-infectious form is 
the L-parBde, which comprises a tegument and a viral envelope, but lacks the nudeocapsid. 
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The herpes virus family has been divided into several subfamilies. Assignments to each of the 
categories were originally based on biologic properties, and are being refined as genomic sequence data 
emerges. The alpha subfamily comprises viruses that have a broad host range, a short implicative cycle, 
and an affinity for the sensory ganglia. They include the human simplex virus and the Varicella-zoster 
5 virus. The beta subfamily comprises viruses that have a restricted host range, and include 
Cytomegalovirus and human Herpes Virus 6. The gamma subfamily comprises viruses that are generally 
lymphotrophic. The DNA is marked by a segment of about 110 kilobases with a low GC content flanked 
by multiple tandem repeats of high GC content. The gamma subfamily includes Epstein Ban* Virus (EBV), 
herpes virus saimiri, equine Herpes Virus 2 and 5, and bovine Herpes Virus 4. 

10 Herpes viruses are associated with conditions that have a complex clinical course. A feature of 

many herpes viruses is the ability to go into a latent state within the host for an extended period of time. 
Viruses of the alpha subfamily maintain latent forms in the sensory and autonomic ganglia, whereas those 
of the gamma subfamily maintain latent forms, for example, in cells of the lymphocyte lineage. Latency is 
associated with the transcription of certain viral genes, and may persist for decades until conditions are 

15 optimal for the virus to resume active replication. Such conditions may include an immunodeficiency. In 
addition, some herpes viruses of the gamma subfamily have the ability to genetically transform the cells 
they infect For example, EBV is associated with B cell lymphomas, oral hairy leukoplakia, lymphoid 
interstitial pneumonitis, and nasopharyngeal carcinoma. 

A number of other conditions occur in humans and other vertebrates that involve fibroprotiferation 

20 and the generation of pre-neoplastic cells. Examples occurring in humans are retroperitoneal fibrosis, 
nodular fibromatosis, pseudosarcomatous fibromatosis, and sclerosing mesenteries. Another condition 
known as Enzootic Retroperitoneal Fibromatosis (RF) has been observed in a colony of macaque 
monkeys at the University of Washington Regional Primate Research Center (Giddens et al.). Late stages 
of the disease are characterized by proliferating fibrous tissue around the mesentery and the dorsal part of 

25 the peritoneal cavity, with extension into the inguinal canal, through the diaphragm, and into the abdominal 
wall. Once clinically apparent, the disease is invariably fatal within 1-2 months. The condition has been 
associated with simian immunodeficiency (SAIDS) due to a type D simian retrovirus, SRV-2 (Tsai et al ). 
However, other colonies do not show the same frequency of RF amongst monkeys affected with SAIDS, 
and the frequency of RF at Washington has been declining in recent years. 

30 The study of such conditions in non-human primates is important not only as a model for human 

conditions, but also because one primate species may act as a reservoir of viruses that affect another 
species. For example, the herpes vims saimiri appears to cause no disease in its natural host, the squirrel 
monkey (Saimiri sdureus), but it causes polyclonal T-cell lymphomas and acute leukemias in other 
primates, particularly owl monkeys. 

35 There is a need to develop reagents and methpdS~fD£ s use in toe^eteSorTand treatment of herpes 

virus infections. The etiological linkage between KSHV ind Kaposi's sarcoma, confirmed by the 
serological evidence, indicates the importance of this need. 

For example, there is a need to develop reagents and methods which can be used in the diagnosis 
and assessment of Kaposi's sarcoma, and similar conditions. Being able to detect the etiologic agent in a 
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new patient may assist in differential diagnosis; being able to assess the level of the agent in an ongoing 
condition may assist in clinical management Desirable markers include those that provide a very 
sensitive indication of the presence of both active and latent forms viral infection, analogous to the HBsAg 
of Hepatitis B. Desirable markers also include those that are immunogenic, and can be used to assess 
immunological exposure to the viral agent as manifest in the antibody response. Glycoprotein antigens 
from the viral envelope are particularly suitable as markers with these characteristics. They may be 
expressed at high abundance near the surface not only of replicative forms of the virus, but also on L- 
particles produced by vitally infected cells. 

Second, there is a need to develop reagents and methods that can be used for treatment of viral 
infection - both prophylactically. and following a viral challenge. Such reagents include vaccines that 
confer a level of immunity against the virus. Passive vaccines, such as those comprising an anti-virus 
antibody, may be used to provide immediate protection or prevent cell penetration and replication of the 
virus in a recently exposed individual. Active vaccines, such as those comprising an immunogenic viral 
component, may be used to elicit an active and ongoing immune response in an individual. Antibody 
elicited by an active vaccine may help protect an individual against a subsequent challenge by live virus. 
Cytotoxic T cells elicited by an active vaccine may help eradicate a concurrent infection by eliminating host 
cells involved in viral replication. Suitable targets for a protective immune response, particularly antibody, 
are protein antigens exposed on the surface of viral particles, and those implicated in fusion of the virus 
with target cells. 

Third, there is a need to develop reagents and methods which can be used in the development of 
new pharmaceuticals for Kaposi's sarcoma, and similar conditions. The current treatment for Kaposi's is 
radiation in combination with traditional chemotherapy, such as vincristine (Northfert. Mitsuyasu). While 
lesions respond to these modalities, the response is temporary, and the downward clinical course 
generally resumes. Even experimental therapies, such as treatment with cytokines, are directed at the 
25 symptoms of the disease rather than the cause. Drug screening and rational drug design based upon the 
etotogic agent can be directed towards the long-felt need for a clinical regimen with long-term efficacy. 
Suitable targets for such pharmaceuticals are viral components involved in recognition and penetration of 
host cells. These include glycoprotein components of the viral envelope. 

Fourth, there is a need to develop reagents and methods which can be used to identify new viral 
agents that may be associated with other fibroprdiferative conditions. The representational difference 
analysis technique used by Chang et aJ. is arduously complex, and probably not appropriate as a general 
screening test. More desirable are a set of oligonucleotide probes, peptides, and antibodies to be used as 
reagents in more routine assays for surveying a variety of tissue samples suspected of containing a related 
etiologic agent. The reagents should be sufficiently specific to avoid identifying unrelated viruses and 
endogenous components of the host, and may be sufficiently cross-reactive to identify related but 
previously undescribed viral pathogens. 
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Disclosure of the Invention 

It is an objective of this invention to provide isolated polynucleotides, polypeptides, and antibodies 
derived from or reactive with the products of novel genes encoding Glycoprotein B molecules of the 
5 RFHV/KSHV subfamily of herpes viruses. Two members of the family are Retroperitoneal Fibromatosis 
associated Herpes Virus (RFHV) and Kaposi's Sarcoma associated Herpes Vims (KSHV). These 
materials and related methods can be used in the diagnosis and treatment of herpes virus infection in 
primates, including humans. Isolated or recombinant Glycoprotein B fragments or polynucleotides 
encoding them may be used as components of an active herpes vaccine, while antibodies specific for 

1 0 Glycoprotein B may be used as components of a passive vaccine. 

Accordingly, one of the embodiments of the invention is an isolated polynucleotide with a region 
encoding a Glycoprotein B of a herpes virus of the RFHV/KSHV subfamily, the polynucleotide comprising 
a sequence of 319 nucleotides at least 65% identical to nucleotides 36 to 354 of SEQ. ID NO:1 or SEQ. ID 
NO:3, which are 319 nucleotide fragments encoding Glycoprotein B from RFHV and KSHV, respectively. 

15 Also embodied is an isolated polynucleotide with a region encoding a Glycoprotein B, the polynucleotide 
comprising a sequence selected from the group consisting of a sequence of 35 nucleotides at least 74% 
identical to oligonucleotide SHMDA (SEQ. ID NO:41); a sequence of 30 nucleotides at least 73% identical 
to oligonucleotide CFSSB (SEQ. ID NO:43); a sequence of 29 nucleotides at least 72% identical to 
oligonucleotide ENTFA (SEQ. ID NO:45); and a sequence of 35 nucleotides at least 80% identical to 

20 oligonucleotide DNIQB (SEQ. ID NO:46). 

Another embodiment of the invention is an isolated polynucleotide comprising a fragment of at least 
21, preferably 35, more preferably 50, still more preferably 75, and even more preferably 100 consecutive 
nucleotides of the Glycoprotein B encoding region of the polynucleotide of the preceding embodiments. 
The polynucleotide is preferably from a virus capable of infecting primates. Included are Glycoprotein B 

25 encoding polynucleotide fragments from RFHV and KSHV. Another embodiment of the invention is an 
isolated polynucleotide comprising a linear sequence of at least about 21 nucleotides identical to a the 
Glycoprotein B encoding sequence between nucleotides 36 to 354 inclusive of SEQ. ID NO:1, SEQ. ID 
NO:3, or SEQ. ID NO:92, or anywhere within SEQ. ID NO:96, but not in SEQ. ID NO:98. 

A further embodiment of this invention is an isolated polypeptide encoded by any of the previous 

30 embodiments. Also embodied is an isolated polypeptide, comprising a linear sequence of at least 17 
amino acids essentially identical to the Glycoprotein B protein sequence shown in SEQ. ID NO:2, SEQ. ID 
NO:4, or SEQ. ID NO:97. or anywhere within SEQ. ID NO:94 (KSHV), but not in SEQ. ID NO:99. This 
includes fusion polypeptides, immunogenic polypeptides, and polypeptides occurring in glycosylated and 
unglycosylated form. Some preferred antigen peptides are listed in SEQ. ID NOS:67-76. Also embodied 

35 are isolated and non-naturally occurring polynucleotides encoding any of the aforementioned polypeptides, 
along with cloning vectors, expression vectors and transfected host cells derived therefrom. Further 
embodiments are method for producing polynucleotides or polypeptides of this invention, comprising 
replicating vectors of the invention or expressing polynucleotides in suitable host cells. 
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Yet another embodiment of this invention is a monoclonal or isolated polyclonal antibody specific for 
a Glycoprotein B polypeptide embodied in this invention, or a Glycoprotein B encoded in the encoding 
region of a polynucleotide embodied in this invention. The antibodies are specific for members of the 
RFHV/KSHV subfamily, and do not cross-react with more distantly related Glycoprotein B sequences, 
5 particularly SEQ. IDNOS:3(M1. 

other Glycoprotein Bantibodies are specfficA monoclonal or isolated polyclonal antibody specific for the 
polypeptide of claim 9, but not for a polypeptide having an amino acid sequence of any of SEQ. ID 
NOS:3(M! 

Still another embodiment of this invention is a vaccine comprising a polypeptide of this invention in 

1 0 a pharmaceutical^ compatible excipient, and optionally also comprising an adjuvant Another embodiment 
of this invention is a vaccine comprising a polynucleotide of this invention, which may be in the form of a 
live virus or viral expression vector. Another embodiment of this invention is a vaccine comprising an 
antibody of this invention in a pharmaceutical^ compatible excipient. Other embodiments are methods for 
treating a herpes virus infection, either prophytactically or during an ongoing infection, comprising 

1 5 administering one of the aforementioned embodiments. 

Further embodiments of this invention are oligonucleotides specific for Glycoprotein B encoding 
sequences of the gamma herpes subfamily, the RFHV/KSHV subfamily, RFHV, and KSHV, especially 
those listed in SEQ. ID NOS:24-63. Also embodied are methods for obtaining an amplified copy of a 
polynucleotide encoding a Glycoprotein B, comprising contacting the polynucleotide with one or more of 

20 the aforementioned oligonucleotides. The polynucleotide to be amplified may be taken from an individual 
affected with a disease featuring fibroblast proliferation and collagen deposition, including but not limited to 
Retroperitoneal Fibromatosis or Kaposi's Sarcoma, or a malignancy of the lymphocyte lineage. 

Additional embodiments of this invention are methods for detecting viral DNA or RNA in a sample. 
One method comprises the steps of contacting the DNA or RNA in the sample with a probe comprising a 

25 polynucleotide or oligonucleotide of this invention under conditions that would permit the probe to form a 
stable duplex with a polynucleotide having the sequence shown in SEQ. ID NO:1 or SEQ. ID NO:3, or 
both, but not with a polynucleotide having a sequence of herpes viruses outside the RFHV/KSHV 
subfamily, particularly SEQ. ID NOS:5-13, and detecting the presence of any duplex formed thereby. The 
conditions referred to are a single set of reaction parameters, such as incubation time, temperature, solute 

30 concentrations, and washing steps, that would permit the polynucleotide to form a stable duplex if 
alternatively contacted with a polynucleotide with SEQ. ID NO:1, or with a polynucleotide with SEQ. ID 
NO:3, or with both, but not with a polynucleotide of any of SEQ ID NO:5-13. Another method comprises 
the steps of amplifying the DNA or RNA in the sample using an oligonucleotide of this invention as a 
primer in the amplification reaction, and detecting the presence of any amplified copies. Also embodied 

35 are isolated polynucleotides identified by the aforementioned methods, as may be present in the genome 
of a naturally occurring vims or affected tissue. 

Further embodiments of this invention are diagnostic kits for detecting components related to 
herpes vims infection in a biological sample, such as may be obtained from an individual suspected of 
harboring such an infection, comprising a polynucleotide, oligonucleotide, polypeptide, or antibody of this 
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invention in suitable packaging. Also embodied are methods of detecting infection of an individual, 
comprising applying the reagents, methods, or kits of this invention on biological samples obtained from 
the individual. 

Still other embodiments of this invention are therapeutic compounds and compositions for use in 
5 treatment of an individual for infection by a gamma herpes virus. Included are therapeutic agents that 
comprise polynucleotides and vectors of this invention for the purpose of gene therapy. Also included are 
pharmaceutical compounds identified by contacting a polypeptide embodied in this invention with the 
compound and determining whether a biochemical function of the polypeptide is altered. Also included are 
pharmaceutical compounds obtained from rational drug design, based on structural and biochemical 
1 0 features of a Glycoprotein B molecule. 

Brief Description of the Dbawiw^ 



Figure 1 is a listing of polynucleotide sequences amplified from a Glycoprotein B encoding region of 

15 RFHV and KSHV. The 319-base polynucleotide segment between residues 36 to 354 is underlined, and 
represents the respective viral gene segment between the primers used to amplify it. Aligned with the 
polynucleotide sequences are oligonucleotides that may be used as hybridization probes or PCR primers. 
Type 1 oligonucleotides comprise a gamma herpes consensus sequence, and can be used to amplify a 
Glycoprotein B gene segment of a gamma herpes virus. Examples shown are NtVPA and TVNCB. Type 

20 2 oligonucleotides comprise a consensus sequence from the RFHV/KSHV subfamily, and can be used to 
amplify Glycoprotein B gene segment of a virus belonging to the subfamily. Examples shown are SHMDA, 
CFSSB, ENTFA and DNIQB. The other oligonucleotides shown are Type 3 oligonucleotides. These 
comprise sequences taken directly from the RFHV or KSHV sequence, and are specific for sequences 
from the respective vims. Oligonucleotides that initiate amplification in the direction of the coding 

25 sequence (with designations ending in "A") are listed 5'->3\ Oligonucleotides that initiate amplification in 
the direction opposite to that of the coding sequence (with designations ending in "B") are listed 3'->5'. 
Also shown are the polypeptides encoded by the RFHV and KSHV polynucleotide sequences. The 
asparagirie encoded by nucleotides 238-240 in both sequences is a potential N-linked glycosyiation site 
conserved with other herpes viruses. 

30 Figure 2 is a map of the Glycoprotein B encoding DNA sequence believed to be contained in the 

KSHV genome, and other members of the RFHV/KSHV subfamily. Shown are the approximate location of 
the KSHV Glycoprotein B sequence described herein. Also shown are the putative conserved segments 
that represent hybridization sites for Type 1 consensus/degenerate oligonucleotides useful in probing and 
amplifying Glycoprotein B sequences from gamma herpes viruses. 

35 Figure 3 is a listing of some previously known herpes virus Glycoprotein B protein sequences, 

aligned with the complete KSHV Glycoprotein B protein sequence and fragments of RFHV1 and RFHV2. 
Boxed regions indicate the putative pre-processing signal sequence and the transmembrane domain. 
Cysteine residues are underlined. Residues that are highly conserved amongst herpes virus Glycoprotein 
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B sequences are underscored with an asterisk (*) Cysteines appearing uniquely in the KSHV 
Glycoprotein B are underscored with a bullet (•). 

Figure 4 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotide FRFDA designed therefrom. 

Figure 5 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotides NIVPA and NIVPASQ designed 
therefrom. 

Figure 6 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotides TVNCA, TVNCB and TVNCBSQ 
designed therefrom. 

Figure 7 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotide FAYDA designed therefrom. 

Figure 8 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotides IYGKA and IYGKASQ designed 
therefrom. 

Figure 9 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma herpes 
viruses, showing a conserved region, and the Type 1 oligonucleotides CYSRA and CYSRASQ designed 
therefrom. 

Figure 10 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma 
herpes viruses, showing a conserved region, and the Type 1 oligonucleotides NIDFB and NIDFBSQ 
designed therefrom. 

Figure 11 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma 
herpes viruses, showing a conserved region, and the Type 1 oligonucleotides FREYA, FREYB and 
NVFDA designed therefrom. 

Figure 12 is a listing of previously known Glycoprotein B polynucleotide sequences of gamma 
herpes viruses, showing a conserved region, and the Type 1 oligonucleotide GGMA designed therefrom. 

Figure 13 is a listing of a portion of the Glycoprotein B polynucleotide sequence from RFHV and 
KSHV, aligned with previously known gamma herpes Glycoprotein B polynucleotide sequences. Each 
shared residue is indicated as a period. 

Figure 14 is a comparison listing of the polypeptide sequences of Glycoprotein B from various 
gamma herpes viruses, encoded between the hybridization sites of NIVPA and TVNCB in the 
polynucleotide sequences. The Class II sequence fragments shown underlined are predicted to be 
RFHV/KSHV cross-reactive antigen peptides. The Class III sequences shown in lower case are predicted 
to be RFHV or KSHV virus-specific peptides. 

Figure 15 is an alignment of the polypeptide sequences of Glycoprotein B over a broader spectrum 
of herpes viruses in the gamma, beta, and alpha subfamilies. 

Figure 16 is a relationship map of Glycoprotein B, based on the polypeptide sequences shown in 
Figure 15. 
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Figure 17 is a listing of exemplary Type 2 (subfamily-spedfic) oligonucleotides, aligned with the 
nucleotide sequences from which they were derived. 

Figure 18 is an approximate map of Glycoprotein B and DNA polymerase encoding regions as they 
appear in the KSHV genome, showing the hybridization position of oligonucleotide primers. 
5 Figure 19 is a listing of a KSHV DNA sequence obtained by amplifying fragments upstream and 

downstream from the sequence in Figure 1. An open reading frame is shown for the complete KSHV 
Glycoprotein B sequence, flanked by open reading frames for the capsid maturation gene and DNA 
polymerase. Underlined in the nucleotide sequence is a putative Glycoprotein B promoter 

Figure 20 is a Hopp-Woods antigenicity plot for the 106 nucleotide Glycoprotein B polypeptide 
1 0 fragment of RFHV encoded between NIVPA and TVNCB. Indicated below are spans of hydrophobic and 
antigenic residues in the sequence. 

Figure 21 is a Hopp-Woods antigenicity plot for the 106 nucleotide Glycoprotein B polypeptide 
fragment of KSHV encoded between NIVPA and TVNCB. 

Figure 22 is a Hopp-Woods antigenicity plot for the complete Glycoprotein B from KSHV. 
15 Figure 23 is a listing of DNA and protein sequences for a Glycoprotein B fragment of a third 

member of the RFHV/KSHV subfamily, designated RFHV2. The 319-base polynucleotide segment 
between residues 36 to 354 is underlined, and represents the Glycoprotein B encoding segment between 
the primers used to amplify it 

20 Best Mode for Carrying Out the Invention 

We have discovered and characterized polynucleotides encoding Glycoprotein B from herpes 
viruses of the RFHV/KSHV subfamily. The polynucleotides, oligonucleotides, polypeptides and antibodies 
embodied in this invention are useful in the diagnosis, clinical monitoring, and treatment of herpes virus 

25 infections and related conditions. 

The source for the polynucleotide for the RFHV Glycoprotein B was affected tissue samples taken 
from Macaque nemestrina monkeys with retroperitoneal fibromatosis f RF~). The polynucleotide for the 
KSHV Glycoprotein B was obtained from affected tissue samples taken from humans with Kaposi's 
Sarcoma fKS*}. The tissues used for the present invention were known to contain genetic material from 

30 RFHV or KSHV, because they had previously been used successfully to done corresponding DNA 
Polymerase encoding fragments. The amplification of the DNA Polymerase regions have been described 
in commonly owned U.S. Patent Application 60/001 ,148. 

In order to amplify the Glycoprotein B sequences from these samples, we designed 
oligonucleotides from those of other herpes viruses. Glycoprotein B is expected to be less well conserved 

35 between herpes viruses, because it is externally exposed on the viral envelope and therefore under 
selective pressure from the immune system of the hosts they infect. Accordingly, the oligonucleotides 
were designed from sequences of hopes viruses believed to be most closely related to RFHV and KSHV. 
These two viruses are known from the DNA polymerase sequences to be closely related gamma type 
herpesviruses. 

9 
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Oligonucleotides were designed primarily from Glycoprotein B sequences previously known for four 
gamma herpes viruses: sHV1, eHV2, bHV4 ( mHV68 and hEBV. Comparison of the amino acid 
sequences of these four Glycoprotein B molecules revealed nine relatively conserved regions. Based on 
the sequence data, oligonucleotides were constructed comprising a degenerate segment and a consensus 
5 segment as described in a following section. Three of these oligonucleotides have been used as primers 
in amplification reactions that have yielded fragments of the RFHV and KSHV Glycoprotein B encoding 
segments from the RF and KS tissue. 

The RFHV and KSHV polynucleotide sequence fragments obtained after the final amplification step 
are shown in Figure 1 (SEQ. ID NO:1 and SEQ. ID NO:3, respectively). Included are segments at each 

10 end corresponding to the hybridizing regions of the NIVPA and TVNCB primers used in the amplification. 
The fragment between the primer binding segments is 319 base pairs in length (residues 36-354), and 
believed to be an accurate reflection of the sequences of the respective Glycoprotein B encoding regions 
of the RFHV and KSHV genomes. 

The 319 base pair Glycoprotein B encoding polynucleotide segment from RFHV is only 60% 

15 identical with that from sHV1 and bHV4, the most closely related sequences from outside the RFHV/KSHV 
subfamily. The 319 base pair polynucleotide segment from KSHV is only 63% identical with sHV1 and 
bHV4. The segments are 76% identical between RFHV and KSHV. 

Also shown are the corresponding predicted amino acid sequences (SEQ. ID NO:2 and SEQ. ID 
NO:4). The polypeptide sequences are novel, and are partly homologous to Glycoprotein B sequences 

20 from other herpes viruses. The fragments shown are predicted to be about 1/8 of the entire Glycoprotein B 
sequence. They begin about 80 amino acids downstream from the predicted N-terminal methionine of the 
pre-processed protein. There is a potential N-linked glycosylation site at position 80 of the amino acid 
sequence, according to the sequence Asn-Xaa-(Thr/Ser). This site is conserved between RFHV and 
KSHV, and is also conserved amongst other known gamma herpes viruses. There is also a cysteine 

25 residue at position 58 that is conserved across herpes viruses of the gamma, beta, and alpha subfamilies, 
which may play a role in maintaining the three-dimensional structure of the protein. 

The 106 amino acid segment of Glycoprotein B encoded by the 319 base pairs between the 
amplification primers is 91% identical between RFHV and KSHV, but only 65% identical between KSHV 
and that of bHV4, the closest sequence outside the RFHV/KSHV subfamily. 

30 Glycoprotein B molecules expressed by the RFHV/KSHV herpes virus subfamily are expected to 

have many of the properties described for Glycoprotein B of other herpes viruses. Glycoprotein B 
molecules are generally about 110 kDa in size, corresponding to about 800-900 amino acids or about 
2400-2700 base pairs. Hydrophobic^ plots indicate regions from the N terminus to the C terminus in the 
following order a hydrophobic region corresponding to a membrane-directing leader sequence; a mixed 

35 polarity region corresponding to an extracellular domain; a hydrophobic region corresponding to a 
transmembrane domain; and another mixed polarity region corresponding to a cytoplasmic domain. 

The full sequence of the KSHV Glycoprotein B, shown in Figure 19, confirms these predictions: 
The gene encodes about 845 amino acids including the signal peptide and a transmembrane region near 
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the C-terminus. Cysteine residues are conserved with other Glycoprotein B sequences, and an additional 
potential disulfide may help stabilize the three-dimensional structure. 

Glycoprotein B is generally expressed on the envelope of infectious and defective viral particles, 
and on the surface of infected cells. It is generally glycosylated, and may comprise 5-20 glycosylate 
5 sites or more. It is also generally expressed as a protein dimer, which assembles during translocation to 
the surface of the host cell, prior to budding of the virus. The site responsible for dimerization appears to 
be located between about amino acid 475 and the membrane spanning segment (Navarro et al.). 

Previous studies have mapped several biochemical functions related to infectivity to different 
regions of the Glycoprotein B molecule. Glycoprotein B and Glycoprotein C are both implicated in initial 
10 binding of HSV1 and bovine herpes virus 1 to target cells (Herald et al., Byrne el al.). The moiety on the 
cells recognized by Glycoprotein B appears to be heparan sulfate; the binding is inhibrtable by fluid-phase 
heparin. Mutants that lack Glycoprotein C can still bind target cells, but mutants that lack both 
Glycoprotein C and Glycoprotein B are severely impaired in their ability to gain access to the cells. 

Another apparently important function is the ability of Glycoprotein B to promote membrane fusion 
15 and entry of the vims into the cell. In human CMV, the fusogenic role appears to map to the first 
hydrophobic domain of Glycoprotein B, and may be associated with conserved glycine residues within this 
region (Reschke et al.). In HSV1 mutants, the ability of Glycoprotein B to promote syncytia formation 
maps to multiple sites in the cytoplasmic domain of the protein, near the C-terminus (Kostal et al.). 

In order to exercise some of these more complicated functions, it seems likely that Glycoprotein B 
20 associates not only with a second Glycoprotein B molecule, but with other components encoded by the 
virus. For example, the UL45 gene product appears to be required for Glycoprotein B induced fusion 
(Haanes et al.). It has been hypothesized that Glycoprotein B cooperates with other surface proteins to 
form a hydrophobic fusion pore in the surface of the target cell (Pereira et al.). Glycoprotein B has been 
found to elicit a potent antibody response capable of neutralizing the intact virus. Monoclonal antibodies 
25 with neutralizing activity may be directed against many different sites on the Glycoprotein B molecule. 

Consequently, it is expected that the Glycoprotein B molecule bears sites that interact with the 
target cell, help promote fusion, and associate with other viral proteins. It is predicted that Glycoprotein B 
molecules of RFHV/KSHV subfamily viruses will perform many of the functions of Glycoprotein B in other 
species of herpes virus, and bear active regions with some of the same properties. Interfering with any of 
30 these active regions with a drug, an antibody, or by mutation, may impair viral infectivity or virulence. 

Subsequent to discovery of the Glycoprotein B of RFHV and KSHV, a third member of the 
RFHV/KSHV subfamily was identified in a sample of affected tissue from a Macaca mufatta (Example 12). 
This Glycoprotein B is closely related but not identical to RFHV, and is designated RFHV2. It is predicted 
that other members of the RFHV/KSHV subfamily will emerge, including some that are pathogenic to 
35 humans. This disclosure teaches how new members of the subfamily can be detected and characterized. 

The homology between Glycoprotein B sequences within the RFHV/KSHV subfamily means that 
the polynucleotides and polypeptides embodied in this invention are reliable markers amongst different 
strains of the subfamily. The polynucleotides, polypeptides, and antibodies embodied in this invention are 
useful in such applications as the detection and treatment of viral infection in an individual, due to RFHV, 
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KSHV, or other herpes viruses in the same subfamily. The polynucleotides, oligonucleotide probes, 
polypeptides, antibodies, and vaccine compositions relating to Glycoprotein B, and the preparation and 
use of these compounds, is described in further detail in the sections that follow. 

Abbreviations 



The following abbreviations are used herein to refer to species of herpes viruses, and 
polynucleotides and polypeptides derived therefrom: 



TABLE 1: Abbre^attons for Herpes Vims Strains 


Designation 


-Vims ■■ . 


Provisional Subfamily 
Assignment 


PPUV/ 


simian Ketropentoneai rioromatosis- 
associated Herpesvirus 


gamma-HerpesVims 


KSHV 


human Kaposi's Sarcoma-associated 
Herpesvirus 




mHveo 


murine Herpesvirus 68 




bHV4 


bovine Herpesvirus 4 




eHV2 


equine Herpesvirus 2 




8HV1 


saimiri monkey Herpesvirus 1 




hEBV 


human Epstein-Barr Vims 




hCMV 


human Cytomegalovirus 


beta-HerpesVtrus 


mCMV 


murine CytoMegaloVims 




gpCMV 


guinea pig CytoMegaloVims 




hHVB 


human Herpesvirus 6 




hVZV 


human Varicella-Zoster Vims 


alpha-HerpesVtms 


HSV1 


human Herpes Simplex Vims 1 




HSV2 


human Herpes Simplex Vims 2 




' 8HVSA8 


simian Herpesvirus A8 




«HV1 


equine Herpesvirus 1 




JHV1 


ictalurid catfish Herpesvirus 





General Definitions 



"Glycoprotein B" is a particular protein component of a herpes vims, encoded in the viral genome 
and believed to be expressed at the surface of the intact vims. Functional studies with certain species of 
herpes vims, especially HSV1, hCMV, and bovine herpes vims 1, have implicated Glycoprotein B in a 
number of biochemical functions related to viral infecthnty. These include binding to components on the 
surface of target cells, such as heparan sulfate, fusion of the viral membrane with the membrane of the 
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target cell, penetration of the viral capsid Into the ceil, and formation of polynudeated syncytial ceils. 
Glycoprotein B has been observed as a homodimer, and may interact with other viral surface proteins in 
order to exert some of its biochemical functions. Different biochemical functions, particularly heparan 
sulfate binding and membrane fusion, appear to map to different parts of the Glycoprotein B molecule. A 
5 Glycoprotein B molecule of other herpes viruses, including members of the RFHV/KSHV subfamily, may 
perform any or all of these functions. As used herein, the term Glycoprotein B includes ungtycosylated, 
partly glycosylated, and folly glycosylated forms, and both monomers and polymers. 

As used herein, a Glycoprotein B fragment, region, or segment is a fragment of the Glycoprotein B 
molecule, or a transcript of a subregion of a Glycoprotein B encoding polynucleotide. The intact 
10 Glycoprotein B molecule, or the full-length transcript, will exert biochemical functions related to viral 
activity, such as those described above. Some or all of these functions may be preserved on the fragment, 
or the fragment may be from a part of the intact molecule which is unable to perform these functions on its 
own. 

"Glycoprotein B activity" refers to any biochemical function of Glycoprotein B, or any biological 
1 5 activity of a herpes virus attributable to Glycoprotein B. These may include but are not limited to binding of 
the protein to cells, cell receptors such as heparan sulfate, and receptor analogs; viral binding or 
penetration into a cell, or cell fusion. 

The term "Glycoprotein B gene" refers to a gene comprising a sequence that encodes a 
Glycoprotein B molecule as defined above. It is understood that a Glycoprotein B gene may give rise to 
20 processed and altered translation products, including but not limited to forms of Glycoprotein B with or 
without a signal or leader sequence, truncated or internally deleted forms, muttimeric forms, and forms with 
different degrees of glycosylation. 

As used herein, a "DNA Polymerase* is a protein or a protein analog, that under appropriate 
conditions is capable of catalyzing the assembly of a DNA polynucleotide with a sequence that is 
25 complementary to a polynucleotide used as a template. A DNA Polymerase may also have other catalytic 
activities, such as 3 -5' exonuclease activity; any of the activities may predominate. A DNA Polymerase 
may require association with additional proteins or co-factors in order to exercise its catalytic function. 

"RFHV is a virus of the herpes family detected in the tissue samples of Macaque nemestrina 
monkeys affected with Retroperitoneal Fibromatosis (RF). RFHV j^synonymous-wtth the terms "RFHV1", 
30 "RFHVMn ", and "RFMn*. "KSHV is a virus of the herpes virus family detected in the tissue samples of 
humans affected with Kaposi' s Sarcoma (K S). A third member of the RFHV/KSHV subfamily is a virus 
identified in a M. mulatta monkey. The virus is referred to herein ,as^RFHV2"r~*RFHV2* is synonymous 

with tfe terms "RFHVMm-arxi-RFMmV — — 

The "RFHV/KSHV subfamily" is a term used herein to refer to a collection of herpes viruses capable 
35 of infecting vertebrate species. The subfamily consists of members that have Glycoprotein B sequences 
that are more closely related to that of the corresponding sequences of RFHV or KSHV than other herpes 
viruses, including sHV1, eHV2, bHV4, mHVBB and hEBV. Preferably, the polynucleotide encoding 
Glycoprotein B comprises a segment that is at least 65% identical to that of RFHV (SEQ. ID NO:1) or 
KSHV (SEQ. ID NO:3) between residues 36 and 354; or at least about 74% identical to the oligonucleotide 
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SHMDA, or at least about 73% identical to the oligonucleotide CFSSB, or at least about 72% identical to 
the nucleotide ENTFA, or at least about 60% identical to the nucleotide DNIQB. RFHV and KSHV are 
exemplary members of the RFHV/KSHV subfamily. The RFHV/KSHV subfamily represents a subset of 
the gamma subfamily of herpes viruses. 

The terms -polynucleotide" and "oligonucleotide" are used interchangeably, and refer to a polymeric 
form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. 
Polynucleotides may have any three-dimensional structure, and may perform any function, known or 
unknown. The following are non-limiting examples of polynucleotides: a gene or gene fragment, exons, 
introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant 
polynucleotides, branched polynucleotides, plasmtds, vectors, isolated DNIA of any sequence, isolated 
RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise modified 
nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the 
nucleotide structure may be imparted before or after assembly of the polymer. The sequence of 
nucleotides may be interrupted by norvnudeotkte components. A polynucleotide may be further modified 
after polymerization, such as by conjugation with a labeling component. 

The term polynucleotide, as used herein, refers to both double- and single-stranded molecules. 
Unless otherwise specified or required, any embodiment of the invention described herein that is a 
polynucleotide encompasses both the double-stranded form and each of two. complementary 
single-stranded forms known or predicted to make up the double-stranded form. 

In the context of polynucleotides, a "linear sequence" or a "sequence" is an order of nucleotides in 
a polynucleotide in a 5* to 3' direction in which residues that neighbor each other in the sequence are 
contiguous in the primary structure of the polynucleotide. A "partial sequence" is a linear sequence of part 
of a polynucleotide which is known to comprise additional residues in one or both directions. 

"Hybridization" refers to a reaction in which one or more polynucleotides react to form a complex 
that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen 
bonding may occur by \AfatsorvCrick base pairing, Hoogsteen binding, or in any other sequence-specific 
manner. The complex may comprise two strands forming a duplex structure, three or more strands 
forming a multi-stranded complex, a single self-hybridizing strand, or any combination of these. A 
hybridization reaction may constitute a step in a more extensive process, such as the initiation of a PGR, 
or the enzymatic cleavage of a polynucleotide by a ribozyme. 

Hybridization reactions can be performed under conditions of different "stringency". Conditions that 
increase the stringency of a hybridization reaction are widely known and published in the art: see, for 
example, Sambrook Fritsch & Maniatis. Examples of relevant conditions include (in order of increasing 
stringency): incubation temperatures of 25*C, 37*C, 50*C, and 68*C; buffer concentrations of 10 x SSC, 6 
. x SSC. 1 x SSC, 0.1 x SSC (where SSC is 0.15 M NaCI and 15 mM citrate buffer) and their equivalent 
using other buffer systems; formamide concentrations of 0%, 25%, 50%, and 75%; incubation times from 5 
min to 24 h; 1 , 2, or more washing steps; wash incubation times of 1 , 5, or 1 5 min; and wash solutions of 6 
x SSC, 1 x SSC, 0.1 x SSC, or deionized water. 
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T m " is the temperature in degrees Centigrade at which 50% of a polynucleotide duplex made of 
complementary strands hydrogen bonded in an anttparallel direction by Watson-Crick base paring 
dissociates into single strands under the conditions of the experiment T m may be predicted according to 
standard formula; for example: 

5 

T m = 81.5 + 16.6 log (Na 4 ] + 0.41 (%G/C) - 0.61 (%F) - 600/L 

where [Na 4 ] is the cation concentration (usually sodium ion) in mol/L; (%G/C) is the number of G and C 
residues as a percentage of total residues in the duplex; (%F) is the percent formamide in solution (wt/vol); 

10 and L is the number of nucleotides in each strand of the duplex. 

A "stable duplex" of polynucleotides, or a 'stable complex" formed between any two or more 
components in a biochemical reaction, refers to a duplex or complex that is sufficiently long-lasting to 
persist between the formation of the duplex or complex, and its subsequent detection. The duplex or 
complex must be able to withstand whatever conditions exist or are introduced between the moment of 

15 formation and the moment of detection, these conditions being a function of the assay or reaction which is 
being performed. Intervening conditions which may optionally be present and which may dislodge a 
duplex or complex include washing, heating, adding additional solutes or solvents to the reaction mixture 
(such as denaturants), and competing with additional reacting species. Stable duplexes or complexes 
may be irreversible or reversible, but must meet the other requirements of this definition. Thus, a transient 

20 complex may form in a reaction mixture, but it does not constitute a stable complex if it dissociates 
spontaneously or as a result of a newly imposed condition or manipulation introduced before detection. 

When stable duplexes form in an antiparallel configuration between two single-stranded 
polynucleotides, particularly under conditions of high stringency, the strands are essentially 
"complementary". A double-stranded polynucleotide can be "comple-mentary" to another polynucleotide, if 

25 a stable duplex can form between one of the strands of the first polynucleotide and the second. A 
complementary sequence predicted from the sequence of a single stranded polynucleotide is the optimum 
sequence of standard nucleotides expected to form hydrogen bonding with the single-stranded 
polynucleotide according to generally accepted base-pairing rules. 

A "sense" strand and an "antisense" strand when used in the same context refer to single-stranded 

30 polynucleotides which are complementary to each other. They may be opposing strands of a 
double-stranded polynucleotide, or one strand may be predicted from the other according to generally 
accepted base-pairing rules. If not specified, the assignment of one or the other strand as "sense" or 
"antisense" may be arbitrary. In relation to a polypeptide-encoding segment of a polynucleotide, the 
"sense* strand is generally the strand comprising the encoding segment. 

35 When comparison is made between polynucleotides for degree of identity, it is implicitly understood 

that complementary strands are easily generated, and the sense or antisense strand is selected or 
predicted that maximizes the degree of Identity between the polynucleotides being compared. For 
example, where one or both of the polynucleotides being compared is double-stranded, the sequences are 
identical if one strand of the first polynucleotide is identical with one strand of the second polynucleotide. 
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Similarly, when a polynucleotide probe is described as identical to its target, it is understood that it is the 
complementary strand of the target that participates in the hybridization reaction between the probe and 
the target 

A linear sequence of nucleotides is "essentially identical" to another linear sequence, if both 
5 sequences are capable of hybridizing to form duplexes with the same complementary polynucleotide. 
Sequences that hybridize under conditions of greater stringency are more preferred. It is understood that 
hybridization reactions can accommodate insertions, deletions, and substitutions in the nucleotide 
sequence. Thus, linear sequences of nucleotides can be essentially identical even if some of the 
nucleotide residues do not precisely correspond or align. Sequences that correspond or align more closely 

10 to the invention disclosed herein are comparably more preferred. Generally, a polynucleotide region of 
about 25 residues is essentially identical to another region, if the sequences are at least about 85% 
identical; more preferably, they are at least about 90% identical; more preferably, they are at least about 
95% identical; still more preferably, the sequences are 100% identical. A polynucleotide region of 40 
residues or more will be essentially identical to another region, after alignment of homologous portions if 

15 the sequences are at least about 75% identical; more preferably, they are at least about 80% identical; 
more preferably, they are at least about 85% identical; even more preferably, they are at least about 90% 
identical; still more preferably, the sequences are 100% identical. 

In determining whether polynucleotide sequences are essentially identical, a sequence that 
preserves the functionality of the polynucleotide with which it is being compared is particularly preferred. 

20 Functionality can be determined by different parameters. For example, if the polynucleotide is to be used 
in reactions that involve hybridizing with another polynucleotide, then preferred sequences are those which 
hybridize to the same target under similar conditions. In general, the T m of a DNA duplex decreases by 
about 1*C for every 1% decrease in sequence identity for duplexes of 200 or more residues; or by about 
5*C for duplexes of less than 40 residues, depending on the position of the mismatched residues (see, 

25 e.g., Meinkoth et al.). Essentially identical sequences of about 100 residues will generally form a stable 
duplex with each other's respective complementary sequence at about 20*C less than T m ; preferably, they 
will form a stable duplex at about 15*C less; more preferably, they will form a stable duplex at about 10*C 
less; even more preferably, they will form a stable duplex at about 5*C less; still more preferably, they will 
form a stable duplex at about T m . In another example, if the polypeptide encoded by the polynucleotide is 

30 an important part of its functionality, then preferred sequences are those which encode identical or 
essentially identical polypeptides. Thus, nucleotide differences which cause a conservative amino acid 
substitution are preferred over those which cause a non-conservative substitution, nucleotide differences 
which do not alter the amino acid sequence are more preferred, white identical nucleotides are even more 
preferred. Insertions or deletions in the polynucleotide that result in insertions or deletions in the 

35 polypeptide are preferred over those that result in the down-stream coding region being rendered out of 
phase; polynucleotide sequences comprising no insertions or deletions are even more preferred. The 
relative importance of hybridization properties and the encoded polypeptide sequence of a polynucleotide 
depends on the application of the invention. 
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A polynucleotide has the same "characteristics" of another polynucleotide if both are capable of 
forming a stable duplex with a particular third polynucleotide under similar conditions of maximal 
stringency. Preferably, in addition to similar hybridization properties, the polynucleotides also encode 
essentially identical polypeptides. 

•Conserved" residues of a polynucleotide sequence are those residues which occur unaltered in the 
same position of two or more related sequences being compared. Residues that are relatively conserved 
are those that are conserved amongst more related sequences or with a greater degree of identity than 
residues appearing elsewhere in the sequences. 

"Related" polynucleotides are polynucleotides that share a significant proportion of identical 
residues. 

As used herein, a "degenerate" oligonucleotide sequence is a designed sequence derived from at 
least two related originating polynucleotide sequences as follows: the residues that are conserved in the 
originating sequences are preserved in the degenerate sequence, while residues that are not conserved in 
the originating sequences may be provided as several alternatives in the degenerate sequence. For 
example, the degenerate sequence AYASA may be designed from originating sequences ATACA and 
ACAGA. where Y is C or T and S is C or G. Y and S are examples of "ambiguous" residues. A 
degenerate segment is a segment of a polynucleotide containing a degenerate sequence. 

It is understood that a synthetic oligonucleotide comprising a degenerate sequence is actually a 
mixture of closely related oligonucleotides sharing an identical sequence, except at the ambiguous 
positions. Such an oligonucleotide is usually synthesized as a mixture of alt possible combinations of 
nucleotides at the ambiguous positions. Each of the oligonucleotides in the mixture is referred to as an 
"alternative form". The number of forms in the mixture is equal to 

n 

n ki 

where ^ is the number of alternative nucleotides allowed at each position. 

As used herein, a "consensus" oligonucleotide sequence is a designed sequence derived from at 
least two related originating polynucleotide sequences as follows: the residues that are conserved in all 
originating sequences are preserved in the consensus sequence; while at positions where residues are not 
conserved, one alternative is chosen from amongst the originating sequences. In general, the nucleotide 
chosen is the one which occurs in the greatest frequency In the originating sequences. For example, the 
consensus sequence AAAAA may be designed from originating sequences CAAAA, AAGAA. and AAAAT. 
A consensus segment is a segment of a polynucleotide containing a consensus sequence. 

A polynucleotide fragment" or "insert" as used herein generally represents a sub-region of the 
full-length form, but the entire fulHength polynucleotide may also be included. 

Polynucleotides "correspond" to each other if they are believed to be derived from each other or 
from a common ancestor. For example, encoding regions in the genes of different viruses correspond if 
they share a significant degree of identity, map to the same location of the genome, or encode proteins 
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that perform a similar biochemical function. Messenger RNA corresponds to the gene from which it is 
transcribed. cDNA corresponds to the RNA from which it has been produced, and to the gene that 
encodes the RNA. A protein corresponds to a polynucleotide encoding it and to an antibody that is 
capable of binding it specifically. 
5 A "probe" when used in the context of polynucleotide manipulation refers to an oligonucleotide 

which is provided as a reagent to detect a target potentially present in a sample of interest by hybridizing 
with the target Usually, a probe will comprise a label or a means by which a label can be attached, either 
before or subsequent to the hybridization reaction. Suitable labels include, but are not limited to 
radioisotopes, fluorochromes. chemiluminescent compounds, dyes, and proteins, including enzymes. 

10 A "primer" is an oligonucleotide, generally with a free 3* -OH group, that binds to a target 

potentially present in a sample of interest by hybridizing with the target and thereafter promotes 
polymerization of a polynucleotide complementary to the target. 

Processes of producing replicate copies of the same polynucleotide, such as PCR or gene 
cloning, are collectively referred to herein as "amplification" or "replication". For example, single or double- 

1 5 stranded DNA may be replicated to form another DNA with the same sequence. RNA may be replicated, 
for example, by an RNA-directed RNA polymerase, or by reverse-transcribing the DNA and then 
performing a PCR. In the latter case, the amplified copy of the RNA is a DNA with the identical sequence. 

A "polymerase chain reaction" ("PCR") is a reaction in which replicate copies are made of a target 
polynucleotide using one or more primers, and a catalyst of polymerization, such as a reverse 

20 transcriptase or a DNA polymerase, and particularly a thermally stable polymerase enzyme. Generally, a 
PCR involves rerteratively performing three steps: "annealing", in which the temperature is adjusted such 
that oligonucleotide primers are permitted to form a duplex with the polynucleotide to be amplified; 
"elongating", in which the temperature is adjusted such that oligonucleotides that have formed a duplex are 
elongated with a DNA polymerase, using the polynucleotide to which they've formed the duplex as a 

25 template; and "melting", in which the temperature is adjusted such that the polynucleotide and elongated 
oligonucleotides dissociate. The cycle is then repeated until the desired amount of amplified 
polynucleotide is obtained. Methods for PCR are taught in U.S. Patent Nos. 4,683,195 (Mulfis) and 
4,683,202 (Multiset al.). 

A "control element* or "control sequence" is a nucleotide sequence involved in an interaction of 
30 molecules that contributes to the functional regulation of a polynucleotide, including replication, duplication, 
transcription, splicing, translation, or degradation of the polynucleotide. The regulation may affect the 
frequency, speed, or specificity of the process, and may be enhancing or inhibitory in nature. Control 
elements are known in the art For example, a "promoter" is an example of a control element A promoter 
is a DNA region capable under certain conditions of rinding RNA polymerase and initiating transcription of 
35 a coding region located downstream (in the 3' direction) from the promoter. "Operatively linked" refers 
to a juxtaposition of genetic elements, wherein the elements are in a relationship permitting them to 
operate in the expected manner. For instance, a promoter is operatively linked to a coding region if the 
promoter helps initiate transcription of the coding sequence. There may be intervening residues between 
the promoter and coding region so long as this functional relationship is maintained. 
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The terms "polypeptide" , 'peptide' and •protein" are used interchangeably herein to refer to 
polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified 
amino adds, and it may be interrupted by non-amino acids. The terms also encompass an amino acid 
polymer that has been modified naturally or by intervention; for example, disulfide bond formation, 
5 gtycosytation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with 
a labeling component 

In the context of polypeptides, a "linear sequence" or a "sequence* is an order of amino acids in a 
polypeptide in an N-terminal to C-terminal direction in which residues that neighbor each other in the 
sequence are contiguous in the primary structure of the polypeptide. A "partial sequence" is a linear 

10 sequence of part of a polypeptide which is known to comprise additional residues in one or both directions. 

A linear sequence of amino acids is "essentially identical" to another sequence rf the two 
sequences have a substantial degree of sequence identity. It is understood that the folding and the 
biochemical function of proteins can accommodate insertions, deletions, and substitutions in the amino 
acid sequence. Thus, linear sequences of amino adds can be essentially identical even rf some of the 

15 residues do not predsely correspond or align. Sequences that correspond or align more dosely to the 
invention disdosed herein are more preferred. It is also understood that some amino acid substitutions are 
more easily tolerated. For example, substitution of an amino acid with hydrophobic side chains, aromatic 
side chains, polar side chains, side chains with a positive or negative charge, or side chains comprising 
two or fewer carbon atoms, by another amino acid with a side chain of like properties can occur without 

20 disturbing the essential identity of the two sequences. Methods for determining homologous regions and 
scoring the degree of homology are well known in the art see for example Altschul et al. and Henikoff et al. 
Well-tolerated sequence differences are referred to as "conservative substitutions". Thus, sequences with 
conservative substitutions are preferred over those with other substitutions in the same positions; 
sequences with identical residues at the same positions are still more preferred. Generally, a 

25 polypeptide region will be essentially identical to another region, after alignment of homologous portions, rf 
the sequences are at least about 92% identical; more preferably, they are at least about 95% identical; 
more preferably, they are at least about 95% identical and comprise at least another 2% which are either 
identical or are conservative substitutions; more preferably, they are at least about 97% identical; more 
preferably, they are at least about 97% identical, and comprise at least another 2% which are either 

30 identical or are conservative substitutions: more preferably, they are at least about 99% identical; still more 
preferably, the sequences are 100% identical. 

In determining whether polypeptide sequences are essentially identical, a sequence that 
preserves the functionality of the polypeptide with which it is being compared is particularly preferred. 
Functionality may be established by different parameters, such as enzymatic activity, the binding rate or 

35 affinity in a substrate-enzyme or receptor-ligand interaction, the binding affinity with an antibody, and X-ray 
crystallographic structure. 

A polypeptide has the same "characteristics" of another polypeptide if it displays the same 
biochemical function, such as enzyme activity, tigand binding, or antibody reactivity. * Preferred 
characteristics of a polypeptide related to a Glycoprotein B or a Glycoprotein B fragment are the ability to 
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bind analogs of the cell surface receptor bound by Glycoprotein B of other herpes species, the ability to 
promote membrane fusion with a target cell, the ability to promote viral penetration of the host cell. Alsc 
preferred is a polypeptide that displays the same biochemical function as the polypeptide with which it is 
being compared, and in addition, is believed to have a similar three-dimensional conformation, as 
5 predicted by computer modeling or determined by such techniques as X-ray crystallography. 

The -biochemical function', "biological function' or "biological activity" of a polypeptide includes 
any feature of the polypeptide detectable by suitable experimental investigation. "Altered" biochemical 
function can refer to a change in the primary, secondary, tertiary, or quaternary structure of the 
polypeptide; detectable, for example, by molecular weight determination, circular dichroism, antibody 

10 binding, difference spectroscopy, or nuclear magnetic resonance. It can also refer to a change in 
reactivity, such as the ability to catalyze a certain reaction, or the ability to tend a cofactor, substrate, 
inhibitor, drug, hapten, or other polypeptide. A substance may be said to "interfere" with the biochemical 
function of a polypeptide if it alters the biochemical function of the polypeptide in any of these ways. 

A fusion polypeptide" is a polypeptide comprising regions in a different position in the sequence 

15 than occurs in nature. The regions may normally exist in separate proteins and are brought together in the 
fusion polypeptide; or they may normally exist in the same protein but are placed in a new arrangement in 
the fusion polypeptide. A fusion polypeptide may be created, for example, by chemical synthesis, or by 
creating and translating a polynucleotide in which the peptide regions are encoded in the desired 
relationship. 

20 An "antibody" (interchangeably used in plural form) is an immunoglobulin molecule capable of 

specific binding to a target, such as a polypeptide, through at least one antigen recognition site, located in 
the variable region of the immunoglobulin molecule. As used herein, the term encompasses not only intact 
antibodies, but also fragments thereof, mutants thereof, fusion proteins, humanized antibodies, and any 
other modified configuration of the immunoglobulin molecule that comprises an antigen recognition site of 

25 the required specificity. 

"Immunological recognition" or "immunological reactivity" refers to the specific binding of a target 
through at least one antigen recognition site in an immunoglobulin or a related molecule, such as a B cell 
receptor or a T cell receptor. 

The term "antigen" refers to the target molecule that is specifically bound by an antibody through 

30 its antigen recognition site. The antigen may, but need not be chemically related to the immunogen that 
stimulated production of the antibody. The antigen may be polyvalent, or it may be a monovalent hapten. 
Examples of kinds of antigens that can be recognized by antibodies include polypeptides, polynucleotides, 
other antibody molecules, oligosaccharides, complex lipids, drags, and chemicals. 

An "immunogen" is a compound capable of stimulating production of an antibody when injected 

35 into a suitable host, usually a mammal. Compounds with this property are described as "immunogenic". 
Compounds may be rendered immunogenic by many techniques known in the art including crosslinking or 
conjugating with a carrier to increase valency, mixing with a mitogen to increase the immune response, 
and combining with an adjuvant to enhance presentation. 
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A "vaccine" is a pharmaceutical preparation for human or animal use, which is administered with 
the intention of conferring the recipient with a degree of specific immunological reactivity against a 
particular target or group of targets. The immunological reactivity may be antibodies or cells (particularly 
B ceils, plasma cells, T helper cells, and cytotoxic T lymphocytes, and their precursors) that are 
5 immunologically reactive against the target, or any combination thereof. Possible targets include foreign or 
pathological compounds, such as an exogenous protein, a pathogenic virus, or an antigen expressed by a 
cancer cell. The immunological reactivity may be desired for experimental purposes, for the treatment of a 
particular condition, for the elimination of a particular substance, or for prophylaxis against a particular 
condition or substance. Unless specifically indicated, a vaccine referred to herein may be either a passive 

1 0 vaccine or an active vaccine, or it may have the properties of both. 

A "passive vaccine" is a vaccine that does not require participation of the recipient's immune 
response to exert its effect Usually, it is comprised of antibody molecules reactive against the target. The 
antibodies may be obtained from a donor subject and sufficiently purified for administration to the recipient 
or they may be produced in vitro, for example, from a culture of hybridoma cells, or by genetically 

1 5 engineering a polynucleotide encoding an antibody molecule. 

An "active vaccine" is a vaccine administered with the intention of eliciting a specific immune * 
response within the recipient that in turn has the desired immunological reactivity against the target An 
active vaccine comprises a suitable immunogen. The immune response that is desired may be either 
humoral or cellular, systemic or secretory, or any combination of these. 

20 A "reagent" polynucleotide, polypeptide, or antibody, is a substance provided for a reaction, the 

substance having some known and desirable parameters for the reaction. 

A reaction mixture may also contain a "target", such as a polynucleotide, antibody, or polypeptide 
that the reagent is capable of reacting with. For example, in some types of diagnostic tests, the amount of 
the target in a sample is determined by adding a reagent, allowing the reagent and target to react, and 

25 measuring the amount of reaction product. In the context of clinical management a target may also be a 
cell, collection of cells, tissue, or organ that is the object of an administered substance, such as a 
pharmaceutical compound. A cell that is a target for a viral infection is one to which a virus preferentially 
localizes for such purposes as replication or transformation into a latent form. 

An "isolated" polynucleotide, polypeptide, protein, antibody, or other substance refers to a 

30 preparation of the substance devoid of at least some of the other components that may also be present 
where the substance or a similar substance naturally occurs or is initially obtained from. Thus, for example, 
an isolated substance may be prepared by using a purification technique to enrich it from a source mixture. 
Enrichment can be measured on an absolute basis, such as weight per volume of solution, or it can be 
measured in relation to a second, potentially interfering substance present in the source mixture. 

35 Increasing enrichments of the embodiments of this invention are increasingly more preferred. Thus, for 
example, a 2-fold enrichment is preferred, 10-fold enrichment is more preferred, 10f>fold enrichment is 
more preferred, 1000-fold enrichment is even more preferred. A substance can also be provided in an 
isolated state by a process of artificial assembly, such as by chemical synthesis or recombinant 
expression. 
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A polynucleotide used in a reaction, such as a probe used in a hybridization reaction, a primer 
used in a PGR, or a polynucleotide present in a pharmaceutical preparation, is referred to as •specific" or 
■selective" if it hybridizes or reacts with the intended target more frequently, more rapidly, or with greater 
duration than it does with alternative substances. Similarly, a polypeptide is referred to as "specific" or 
5 "selective - if it binds an intended target, such as a ligand, hapten, substrate, antibody, or other polypeptide 
more frequently, more rapidly, or with greater duration than it does to alternative substances. An antibody 
is referred to as "specific" or "selective' if it binds via at least one antigen recognition site to the intended 
target more frequently, more rapidly, or with greater duration than it does to alternative substances. A 
polynucleotide, polypeptide, or antibody is said to "selectively inhibit" or 'selectively interfere with" a 

1 0 reaction if it inhibits or interferes with the reaction between particular substrates to a greater degree or for a 
greater duration than it does with the reaction between alternative substrates. 

A "pharmaceutical candidate" or "Aug candidate" is a compound believed to have therapeutic 
potential, that is to be tested for efficacy. The "screening" of a pharmaceutical candidate refers to 
conducting an assay that is capable of evaluating the efficacy and/or specificity of the candidate. In this 

15 context, "efficacy" refers to the ability of the candidate to affect the cell or organism it is administered to in a 
beneficial way: for example, the limitation of the pathology due to an invasive vims. 

The "effector component" of a pharmaceutical preparation is a component which modifies target 
cells by altering their function in a desirable way when administered to a subject bearing the cells. Some 
advanced pharmaceutical preparations also have a "targeting component", such as an antibody, which 

20 helps deliver the effector component more efficaciously to the target site. Depending on the desired action, 
the effector component may have any one of a number of modes of action. For example, it may restore or 
enhance a normal function of a cell, it may eliminate or suppress an abnormal function of a cell, or it may 
alter a cell's phenotype. Alternatively, it may kill or render dormant a cell with pathological features, such 
as a virally infected cell. Examples of effector components are provided in a later section. 

25 A "cell line" or "cell culture" denotes higher eukaryotic cells grown or maintained in vitro. It is 

understood that the descendants of a cell may not be completely identical (either morphologically, 
genotypically, or phenotypically) to the parent cell. 

A "host cell" is a cell which has been transformed, or is capable of being transformed, by 
administration of an exogenous polynucleotide. A "host cell" includes progeny of the original transformant 

30 "Genetic alteration" refers to a process wherein a genetic element is introduced into a cell other 

than by natural cell division. The element may be heterologous to the cell, or rt may be an additional copy 
or improved version of an element already present in the cell. Genetic alteration may be effected, for 
example, by transfecting a cell with a recombinant plasmid or other polynucleotide through any process 
known in the art, such as electroporation, calcium phosphate precipitation, contacting with a 

35 polynucleotide-liposome complex, or by transduction or infection with a DNA or RNA virus or viral vector. 
The alteration is preferably but not necessarily inheritable by progeny of the altered cell. 

An "individual" refers to vertebrates, particularly members of a mammalian species, and includes 
but is not limited to domestic animals, sports animals, arid primates, including humans. 
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The term "primate" as used herein refers to any member of the highest order of mammalian 
species. This includes (but is not limited to) prosimians, such as lemurs and forises; tarsioids, such as 
tarsiers; new-world monkeys, such as squirrel monkeys (Saimiri sciureus) and tamarins; old-world 
monkeys such as macaques (including Macaca nemestrina, Macaca fasocufahs, and Macaca fuscata); 
5 hylobatids, such as gibbons and stamangs; pongids. such as orangutans, gorillas, and chimpanzees; and 
hominids, including humans. 

The "pathology" caused by a herpes virus infection is anything that compromises the well-being or 
normal physiology of the host This may involve (but is not limited to) destructive invasion of the vims into 
previously uninfected cells, replication of the virus at the expense of the normal metabolism of the cell, 
10 generation of toxins or other unnatural molecules by the virus, irregular growth of cells or intercellular 
structures (including fibrosis), irregular or suppressed biological activity of infected cells, malignant 
transformation, interference with the normal function of neighboring cells, aggravation or suppression of an 
inflammatory or immunological response, and increased susceptibility to other pathogenic organisms and 
conditions. 

15 Treatment" of an individual or a cell is any type of intervention in an attempt to alter the natural 

course of the individual or cell. For example, treatment of an individual may be undertaken to decrease or 
limit the pathology caused by a herpes vims infecting the individual. Treatment includes (but is not limited 
to) administration of a composition, such as a pharmaceutical composition, and may be performed either 
prophyfactically, or therapeutically, subsequent to the initiation of a pathologic event or contact with an 

20 etiologic agent 

It is understood that a clinical or biological "sample" encompasses a variety of sample types 
obtained from a subject and useful in an in vitro procedure, such as a diagnostic test The definition 
encompasses solid tissue samples obtained as a surgical removal, a pathology specimen, or a biopsy 
specimen, tissue cultures or cells derived therefrom and the progeny thereof, and sections or smears 

25 prepared from any of these sources. Non-limiting examples are samples obtained from infected sites, 
fibrotic sites, unaffected sites, and tumors. The definition also encompasses blood, spinal fluid, and other 
liquid samples of biologic origin, and may refer to either the cells or cell fragments suspended therein, or to 
the liquid medium and its solutes. The definition also includes samples that have been soiubilized or 
enriched for certain components, such as DNA, RNA, protein, or antibody. 

30 Oligonucleotide primers and probes described herein have been named as follows: The first part 

of the designation is the single amino acid code for a portion of the conserved region of the polypeptide 
they are based upon, usually 4 residues long. This is followed with the letter A or B, indicating respectively 
that the oligonucleotide is complementary to the sense or anti-sense strand of the encoding region. 
Secondary consensus oligonucleotides used for sequencing and labeling reactions have the letters SQ at 

35 the end of the designation. 
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General techniques 

The practice of the present invention will employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, recombinant DNA, and immunology, which are within the 
5 skill of the art Such techniques are explained fully in the literature. See. for example, "Molecular Cloning: 
A Laboratory Manual", Second Edition (Sambrook, Fritsch & Maniatis, 1989), "Oligonucleotide Synthesis" 
(M.J. Gait. ed. t 1984), "Animal Cell Culture" (R.I. Freshney, ed. t 1987); the series "Methods in Enzymology" 
(Academic Press, Inc.); "Handbook of Experimental Immunology" (D.M. Weir & C.C. Blackwell, eds.), 
"Gene Transfer Vectors for Mammalian Cells" (J.M. Miller & M P. Calos, eds., 1987), "Current Protocols in 
10 Molecular Biology" (F.M. Ausubel et al., eds., 1987); and "Current Protocols in Immunology" (J.E Coligan 
etal., eds., 1991). 

All patents, patent applications, articles and publications mentioned herein, both supra and infra, 
are hereby incorporated herein by reference. 

1 5 Polynucleotides encoding Glycoprotein B of the herpes virus RFHV/KSHV subfamffy 

This invention embodies isolated polynucleotide segments derived from Glycoprotein B genes 
present in herpes viruses that encode a fragment of a Glycoprotein B polypeptide. The polynucleotides 
are related to the RFHV/KSHV subfamily of herpes viruses. Exemplary polynucleotides encode 
20 Glycoprotein B fragments from either RFHV or KSHV. Preferred fragments include those shewn in Figure 
1, and subfragments thereof, obtained as described in the Example section below. Especially preferred is 
] the polynucleotide comprising the sequence between residues 36-354 of SEQ. ID NO:1 , SEQ. ID NO:3, or 
SEQ. ID NO:96, or polynucleotides contained in SEQ. ID NO:92. 

The polynucleotide segments of RFHV and KSHV between residues 36 and 354 are 76% identical. 
25 Shared residues are indicated in Figure 1 by "*". The longest subregions that are identically shared 
between RFHV and KSHV within this segment are 15, 17, and 20 nucleotides in length. 

The 31 9 base pair fragments of RFHV and KSHV between the amplification primer binding sites are 
more identical to each other than either of them are to that of any previously sequenced herpes virus. The 
next most closely related sequences are sHV1 and bHV4, which are 63% identical to the corresponding 
30 sequence of KSHV, and 60% identical to the corresponding sequence of RFHV. The longest number of 
consecutive bases shared between the Glycoprotein B fragment and any of the previously sequenced 
viruses is 14. It is believed that any subfragment of the RFHV or KSHV sequence of 16 base pairs or 
longer will be unique to the RFHV/KSHV subfamily, or to particular herpes virus species and variants 
within the subfamily. 

35 This invention embodies subfragments contained in the Glycoprotein B gene of the RFHV/KSHV 

subfamily, preferably contained in the region corresponding to the 319 base pair fragment between 
residues 36-354 of SEQ. ID NO:1, SEQ. ID NO:3, or SEQ. ID NO:96, or anywhere in SEQ. ID NO:92. 
Preferably, the sub-fragments are at least about 16 nucleotides in length; more preferably they are at least 
18 nucleotides in length; more preferably they are at least 21 nucleotides in length; more preferably they 
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are at least about 25 nucleotides in length; more preferably they are at least about 35 nucleotides in length; 
still more preferably they are at least about 50 nucleotides in length; yet more preferably they are at least 
about 75 nucleotides in length, and even more preferably they are 100 nucleotides in length or more. Also 
embodied in this invention are polynucleotides comprising the entire open reading frame of each 
5 respective herpes virus Glycoprotein B. 

The RFHV/KSHV subfamily consists of members that have sequences that are more closely 
identical to the corresponding sequences of RFHV or KSHV, than RFHV or KSHV are to any other virus 
listed in Table 1 . Preferred members of the family may be identified on the basis of the sequence of the 
Glycoprotein B gene in the region corresponding to that of Figure 1. Table 2 provides the degree of 
1 0 sequence identities in this region: 



| TABLE 2: Sequence Wentftles Between Glycoprotein B of KSHV and other Herpes Viruses 






Identity to polynucleotide fragment 


Glycoprotein B 
Sequence 


SEQ. ID NO: 


RFHV 
(SEaiDNO:1) 
Bases 363S4 


KSHV 
(SECLIDNO:3) 
Bases 36-364 


RFHV 




(100%) 


76% 


RFHV/KSHV 
subfamily 








KSHV 




76% 


(100%) 


Other gamma sHVI 
herpesviruses 


5 


60% 


63% 


bHV4 


6 


60% 


63% 


eHV2 


7 


52% 


54% 


mHV68 


8 


56% 


54% ■ 


hEBV 


9 


<50% 


52% 


alpha and beta hCMV 
herpesviruses 


10 


<50% 


<50% 


hHVB 


11 


<50% 


<50% 


hVZV 


12 


<50% 


<50% 


1 HSV1 


13 : I 


<50% 


<50% 



The percentage of sequence Identity is calculated by first aligning the encoded amino acid 
15 sequence, determining the corresponding alignment of the encoding polynucleotide, and then counting 
the number of residues shared between the sequences being compared at each aligned position. No 
penalty is imposed for the presence of insertions or deletions, but insertions or deletions are permitted 
only where required to accommodate an obviously increased number of amino acid residues in one of 
the sequences being aligned. Offsetting insertions just to improve sequence alignment are not 

AT 
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permitted at either the polypeptide or polynucleotide level. Thus, any insertions in the polynucleotide 
sequence will have a length which is a multiple of 3. The percentage is given in terms of residues in the 
test sequence that are identical to residues in the comparison or reference sequence. 

Preferred Glycoprotein B encoding polynucleotide sequences of this invention are those derived 
5 from the RFHV/KSHV herpes virus subfamily. TTiey include those sequences that are at least 65% 
identical with the RFHV or KSHV sequence between bases 36 and 354; more preferably, the 
sequences are at least 67% identical; more preferably, the sequences are at least about 70% identical; 
more preferably, the sequences are at least about 75% identical; more preferably, the sequences are at 
least about 80% identical; more preferably, the sequences are at least about 85% identical; more 

10 preferably, the sequences are at least about 90% identical; even more preferably, the sequences are 
over 95% identical. Also included are Glycoprotein B encoding regions that are upstream or 
downstream of a region fulfilling the identity criteria indicated. 

Other preferred Glycoprotein B encoding polynucleotide sequences may be identified by the 
percent identity with RFHV/KSHV subfamily-specific oligonucleotides (Type 2 oligonucleotides), 

15 described in more detail in a later section. The percent identity of RFHV and KSHV Glycoprotein B with 
exemplary Type 2 oligonucleotides is shown in Table 3: 



TABLE 3: Sequence Identities Between Glycoprotein B of Select Herpes Viruses and 
. RFHV/KSHV Subfamily Specific Oligonucleotides 



Glycoprotein B 
Sequence 


SEQ. ID 
NO: 


Identity to 
SHMDA 

(SEQ. ID 
NO:41) 


Identity to 
CFSSB 

(SEQ. ID 
NO-.43) 


Identity to 
ENTFA 

(SEQ. ID 
NO:45) 


Identity to 
DN1QB 

(SEQ. ID 
NO*6) 


RFHV 


1 


91% 


91% 


89% 


91% 


KSHV 


3 


100% 


85% 


89% 


97% 


SHV1 


5 


71% 


70% 


66% 


66% 


bHV4 


6 


57% 


64% 


69% 


74% 


eHV2 


7 


57% 


61% 


54% 


60% 


mHVBB 


8 


<50% 


55% 


54% 


77% 


hEBV 


9 


57% 


55% 


60% 


51% 


hCMV 


10 


57% 


55% 


60% 


51% 


hHV6 


11: 


<50% 


52% 


60% 


57% 


hVZV 




54% 


58% 


66% 


57% 


HSV1 


13 


57% 


60% 


54% 


54% 



Percent identity is calculated for oligonucleotides of this length by not allowing gaps in either the 
20 oligonucleotide or the polypeptide for purposes of alignment. Throughout this disclosure, whenever at 
least one of two sequences being compared is a degenerate oligonucleotide comprising an ambiguous 
residue, the two sequences are identical if at least one of the alternative forms of the degenerate 
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oligonucleotide is identical to the sequence with which it is being Compared. As an illustration, AY AAA 
is 100% identical to ATAAA, since AYAAA is a mixture of ATAAA and ACAAA. 

Preferred Glycoprotein B encoding sequences are those which over the corresponding region 
are at least 72% identical to SHMDA; more preferably they are at least 74% identical; more preferably 
5 they are at least about 77% identical; more preferably they are at least about 80% identical; more 
preferably they are at least about 85% identical; more preferably they are at least about 91% identical. 
Other preferred Glycoprotein B encoding sequences are those which over the corresponding region are 
at least 71% identical to CFSSB; more preferably they are at least 73% identical; more preferably they 
are at least about 77% identical; more preferably they are at least about 80% identical; more preferably 

10 they are at least about 85% identical. Other preferred Glycoprotein B encoding sequences are those 
which over the corresponding region are at least 70% identical to ENTFA; more preferably they are at 
least 72% identical; more preferably they are at least about 75% identical; more preferably they are at 
least about 80% identical; more preferably they are at least about 85% identical; even more preferably, 
they are at least about 89% identical. Other preferred Glycoprotein B encoding sequences are those 

15 which over the corresponding region are at least about 78% identical to DNIQB; more preferably they 
are at least 80% identical; more preferably they are at least about 85% identical; more preferably they 
are at least about 91% identical. Also included are Glycoprotein B encoding regions that are upstream 
or downstream of a region fulfilling the identity criteria indicated. 

Glycoprotein B encoding sequences from members of the RFHV/KSHV subfamily identified by 

20 any of the aforementioned sequence comparisons, using either RFHV or KSHV sequences, or the 
subfamily-specific oligonucleotides, are equally preferred. Exemplary sequences are the Glycoprotein B 
encoding sequences of RFHV and KSHV. Also embodied in this invention are fragments of any 
Glycoprotein B encoding sequences of the subfamily, and longer polynucleotides comprising such 
polynucleotide fragments. 

25 The polynucleotide sequences described in this section provide a basis for obtaining the 

synthetic oligonucleotides, proteins and antibodies outlined in the sections that follow. These 
compounds may be prepared by standard techniques known to a practitioner of ordinary skill in the art, 
and may be used for a number of investigative, diagnostic, and therapeutic purposes, as described 
below. 

30 

Preparation of polynucleotides 

Polynucleotides and oligonucleotides of this invention may be prepared by any suitable method 
known in the art For example, oligonucleotide primers can be used in a PGR amplification of DNA 
35 obtained from herpes virus infected tissue, as in Example 3 and Example 11, described below. 
Alternatively, oligonucleotides can be used to identify suitable bacterial clones of a DNA library, as 
described below in Example 8. 

Polynucleotides may also be prepared directly from the sequence provided herein by chemical 
synthesis. Several methods of synthesis are known in the art, including the triester method and the 
40 phosphite method. In a preferred method, polynucleotides are prepared by solid-phase synthesis using 
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mononucleoside phosphoramidite coupling units. See, for example Horise et al. a Beaucage et al.. Kumar 
et al., and U.S. Patent No. 4,415.732. 

A typical solid-phase synthesis involves reiterating four steps: deprotection, coupling, capping, and 
oxidation. This results in the stepwise synthesis of an oligonucleotide in the 3' to 5' direction. 
5 In the first step, the growing oligonucleotide, which is attached at the 3'-end via a (-O-) group to a 

solid support, is deprotected at the 5' end. For example, the 5' end may be protected by a -ODMT group, 
formed by reacting with 4,4'-dimethoxytrityl chloride (DMT-CI) in pyridine. This group is stable under basic 
conditions, but is easily removed under acid conditions, for example, in the presence of dichloroacetic acid 
(DCA) or trichloroacetic arid (TCA). Deprotection provides a 5* -OH reactive group. 
10 In the second step, the oligonucleotide is reacted with the desired nucleotide monomer, which itself 

has first been converted to a S'-protected, 3 -phosphoramidite. The 5* -OH of ttie monomer may be 
protected, for example, in the form of a -ODMT group, and the 3-OH group may be converted to a 
phosphoramidite, such as -OP(OR')NR 2 ; where R is the isopropyl group -CHfCHafc; and R* is, for example, 
-H (yielding a phosphoramidite diester), or -CH 3 , -CH 2 CH 3 , or the beta-cyanoethyl group -CH 2 CH 2 CN 
1 5 (yielding a phosphoramidite triester). The 3'-phosphoramidite group of the monomer reacts with the 5* -OH 
group of the growing oligonucleotide to yield the phosphite linkage 5'-OP(OR*)0-3\ 

In the third step, oligonucleotides that have not coupled with the monomer are withdrawn from 
further synthesis to prevent the formation of incomplete polymers. This is achieved by capping the 
remaining 5' -OH groups, for example, in the form ctf acetates (-OCfOJCH*) by reaction with acetic 
20 anhydride (CHjCJOJ-O-CPJCHa). 

In the fourth step, the newly formed phosphite group (i.e., 5'-OP(OR')0-3') is oxidized to a 
phosphate group (i.e., 5'-OP(=OXOR*)0-3'); for example, by reaction with aqueous iodine and pyridine. 

The four-step process may then be reiterated, since the oligonucleotide obtained at the end of the 
process is 5 a -protected and is ready for use in step one. When the desired futUength oligonucleotide has 
25 been obtained, it may be cleaved from the solid support, for example, by treatment with alkali and heat. 
This step may also serve to convert phosphate triesters (i.e., when R* is not -H) to the phosphate diesters 
(-0P<=0)20-), and to deprotect base-labile protected amino groups of the nucleotide bases. 

Polynucleotides prepared by any of these methods can be replicated to provide a larger supply by 
any standard technique, such as PGR amplification or gene cloning. 

30 

Cloning and expression vectors comprising a Glycoprotein B encoding polynucleotide 

Cloning vectors and expression vectors are provided in this invention that comprise a sequence 
encoding a herpes virus Glycoprotein B or variant or fragment thereof. Suitable cloning vectors may be 
35 constructed according to standard techniques, or may be selected from the large number of cloning 
vectors available in the art While the cloning vector selected may vary according to the host cell intended 
to be used, useful cloning vectors will generally have the ability to self-replicate, may possess a single 
target for a particular restriction endonudease, and may canry genes for a marker that can be used in 
selecting transfected clones. Suitable examples include plasmids and bacterial viruses; e.g., pUC18, 
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mp18, mp19, pBR322, pMB9. ColE1, pCR1. RP4, phage DNAs, and shuttle vectors like pSA3 and pAT28. 

Expression vectors generally are replicable polynucleotide constructs that encode a polypeptide 
operatively linked to suitable transcriptional and translational controlling elements. Examples of 
5 transcriptional controlling elements are promoters, enhancers, transcription initiation sites, and 
transcription termination sites. Examples of translational controlling elements are ribosome binding sites, 
translation initiation sites, and stop codons. Protein processing elements may also be included: for 
example, regions that encode leader or signal peptides and protease cleavage sites required for 
translocation of the polypeptide across the membrane or secretion from the cell. The elements employed 
1 0 would be functional in the host cell used for expression. The controlling elements may be derived from the 
same Glycoprotein B gene used in the vector, or they may be heterologous (i.e., derived from other genes 
and/or other organisms). 

Polynucleotides may be inserted into host cells by any means known in the art Suitable host cells 

include bacterial cells such as £. cofc mycobacteria, other prokaryotic microorganisms and eukaryotic cells 
1 5 (including fungal cells, insect cells, plant cells, and animal cells). The cells are transformed by inserting the 

exogenous polynucleotide by direct uptake, endocytosis, transfection, f-mating, or electroporation. 

Subsequently, the exogenous polynucleotide may be maintained within the cefl as a non-integrated vector, 

such as a plasmid, or may alternatively be integrated into the host cell genome. 

Cloning vectors may be used to obtain replicate copes of the polynucleotides they contain, or as a 
20 means of storing the polynucleotides in a depository for future recovery. Expression vectors and host cells 

may be used to obtain polypeptides transcribed by the polynucleotides they contain. They may also be 

used in assays where it is desirable to have intact cells capable of synthesizing the polypeptide, such as in 

a drug screening assay. 

25 Synthetic Type 1 oligonucleotides for Glycoprotein B of gamma herpes virus 

Oligonucleotides designed from sequences of herpes vims Glycoprotein B, as embodied in this 
invention, can be used as probes to identify related sequences, or as primers in an amplification reaction 
such as a PGR. 

30 Different oligonucleotides with different properties are described in the sections that follow. 

Oligonucleotides designated as Type 1 are designed from previously known gamma herpes virus 
Glycoprotein B polynucleotide sequences. They are designed to hybridize with polynucleotides encoding 
any gamma herpes virus Glycoprotein B. and may be used to detect previously known species of gamma 
herpes virus. They may also be used to detect and characterize new species of gamma herpes virus. 

35 Oligonucleotides designated as Type 2 are designed from the RFHV and KSHV Glycoprotein B 
polynucleotide sequences together. They are designed to hybridize with polynucleotides encoding 
Glycoprotein B of the RFHV/KSHV subfamily, including but not limited to RFHV and KSHV. 
Oligonucleotides designated as Type 3 are designed from RFHV or KSHV Glycoprotein sequences that 
are relatively unique to the individual virus. They are designed to hybridize specifically with 

40 polynucleotides encoding Glycoprotein B only from RFHV or KSHV and closely related viral strains. 
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Some preferred examples of Type 1 oligonucleotides are listed in Table 4. These oligonucleotides 
have a specificity for Glycoprotein B encoding polynucleotides of a broad range of herpes viruses. 



TABIX4: ty^ Vin* 

Poly nucleotides enabling Glycoprotein B 




Target Herpes C^rapro^ 8, especially fromgamro HerpesWuses 


.: : : ; De^g-r:' 

nation 


Sequence '■'}■: . ' 

:" : " ; -'iff--to :: ap* 


Length 


Naof 
: fonns 


Orien- 
tation 


SEQ 

ID; 




GCTGTTCAGA7TTGACTTAGAYMANMCNTGYCC 


33 


256 


sense 


13 


NIVPA 


GTGTACAAGAAGAACATCGTGCCNTAYATNTTYA 

A 
M 


32 


64 


sense 


14 


NIVPASQ 


GTGTACAAGAAGAACATCGTGCC 


23 


1 




15 


TVMCB 


AACATGTCTACAATCTCACARTTNACNGTNGT 


32 


128 


anti- 
sense 


16 




AAPATYVTPTAriAATPTPAPA 


ZU 


4 

1 




17 


FAYDA 


AATAACCTCTTTACGGCCCAAATTCARTWYGCN 


38 


64 


sense 


18 




CCAACGAGTGTGATGTCAGCCATTTAYGGNAAR 


38 


64 


sense 


19 


IYGKASQ 


CCAACfWSTGTGATGTCAGCC 


£ 1 


l 




ZU 


CYSRA 


TGCTACTCGCGACCTCTAGTCACCTTYAART7YR 
TNAA 


38 


64 


sense 


21 


CYSRASQ 


TGCTACTCGCGACCTCTAGTCACC 


24 


1 




22 


:NIDFB 


ACCGGAGTACAGTTCCACTGTYTTRAARTCDATR 
TT 


36 


48 


anti- 
sense 


23 


NIDFBSQ 


TGTCACCTTGACATGAGGCCA 


21 


1 




24 


FREYA 


TTTGACCTGGAGACTATGTTYMGNGARTAYAA 


32 


64 


sense 


25 


FREYB 


GCTCTGGGTGTAGTAGTTRTAYTCYCTRAACAT 


33 


16 


anti- 
sense 


26 


NVFDB 


TCTCGGAACATGCTCTCCAGRTCRAAMACRTT 


32 


32 


anti- 
sense 


27 


GGMA 


ACCTTCATCAAAAATCCCTTNGGNGGNATGYT 


32 


128 


sense 


28 


TVNCA 


TGGACTTACAGGACTCGAACNACNGTNAAYTG 


32 


128 


sense 


29 



The orientation indicated in Table 4 is relative to the encoding region of the polynucleotide. 
5 Oligomers with a "sense" orientation will hybridize to the strand antisense to the coding strand and 
initiate amplification in the direction of the coding sequence. Oligomers with an •antisense" orientation 
will hybridize to the coding strand and initiate amplification in the direction opposite to the coding 
sequence. 

These oligonucleotides have been designed with several properties in mind: 1) sensitivity for 
10 target DNA even when present in the source material at very low copy numbers; 2) sufficient specificity 
to avoid hybridizing with unwanted sequences; for example, host sequences with limited similarity; 3) 

30 
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sufficient cross-reactivity so that differences between an unknown target and the sequence used to 
design it do not prevent the oligonucleotide from forming a stable duplex with the target. 

For some applications, a particularly effective design is oligonucleotides that have a degenerate 
segment at the 3* end, designed from a region of at least 2 known polynucleotides believed to be 
5 somewhat conserved with the polynucleotide target. The various permutations of the ambiguous 
residues help ensure that at least one of the alternative forms of the oligonucleotide will be able to 
hybridize with the target. Adjacent to the degenerate segment at the 5' end of the oligonucleotide is a 
consensus segment which strengthens any duplex which may form and permits hybridization or 
amplification reactions to be done at higher temperatures. The degenerate segment is located at the 3' 
10 end of the molecule to increase the likelihood of a dose match between the oligonucleotide and the 
target at the site where elongation begins during a polymerase chain reaction. 

The ambiguous residues in the degenerate part of the sequences are indicated according to the 
following code: 



TABLE S: Single Letter Codes for Ambiguous Positions 


: ^Code:' : ': 


Represents 


R 


AorG (purine) » 


Y 


CorT (pyrimidine) 


W 


AorT 


S 


CorG 


M 


AorC 


K 


GorT 


B 


C or G or T (not A) 


D 


AorGorT (notC) 


H 


AorC or T (notG) 


V 


AorCorG (notT) 


N 


AorCorGorT 



15 

The Type 1 oligonucleotides shown in Table 4 are generally useful for hybridizing with 
Glycoprotein B encoding polynucleotide segments. This may be conducted to detect the presence of the 
polynucleotide, or to prime an amplification reaction so that the polynucleotide can be characterized 
further. Suitable targets include polynucleotides encoding a region of a Glycoprotein B from a wide 

20 spectrum of gamma herpes viruses, including members of the RFHV/KSHV subfamily. Suitable are those 
infecting any vertebrate animal, including humans and non-human primates, whether or not the 
Glycoprotein B or the virus has been previously known or described. Non-limiting examples include 
polynucleotides encoding Glycoprotein B from any of the gamma herpes viruses listed in Table 1 . 

The oligonucleotides may be used, inter alia, to prime a reaction to amplify a region of the target 

25 polynucleotide in the 3* direction from the site where the oligonucleotide hybridizes. FRFDA, HIVPA, 
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TVNCB, FAYDA, IYGKA, CYSRA, NIDFB, FREYA, FREYB, NVFDB, GGMA, and TVNCA are 
oligonucleotides with a consensus segment adjoining a degenerate segment and are useful for this 
purpose. 

Figure 2 shows the position along the Glycoprotein B polynucleotide sequence of the 
5 RFHV/KSHV subfamily where the aforementioned oligonucleotide primers are expected to hybridize. The 
map is not drawn to scale, but accurately depicts the order of the predicted hybridization sites in the 5' to 3' 
direction along the Glycoprotein B encoding strand. Also indicated are approximate lengths of 
amplification products that may be generated by using various sets of primers in an amplification reaction. 
The lengths shown include the primer binding sites at each end, and the polynucleotide encompassed 
10 between them. 

A preferred source of DNA for use as a target for the oligonucleotides of Table 4 is any biological 
sample (including solid tissue and tissue cultures), particularly of vertebrate animal origin, known or 
suspected to harbor a herpes virus. DNA is extracted from the source by any method known in the art. 
including extraction with organic solvents or precipitation at high salt concentration. 

15 A preferred method of amplification is a polymerase chain reaction: see generally U.S. Patent No. 

4,683,195 (Mullis) and U.S. 4,683,202 (Mullis et al.); see U.S. 5,176,995 (Sninsky et al.) for application to 
viral polynucleotides. An amplification reaction may be conducted by combining the target polynucleotide 
to be amplified with short oligonucleotides capable of hybridizing with the target and acting as a primer for 
the polymerization reaction. Also added are substrate mononucleotides and a heat-stable DNA-dependent 

20 Glycoprotein B, such as Taq. The conditions used for amplification reactions are generally known in the 
art, and can be optimized empirically using sources of known viruses, such RFHV, KSHV. hEBV or HSV1 . 
Conditions can be altered, for example, by changing the time and temperature of the amplification cycle, 
particularly the hybridization phase; changing the molarity of the oligonucleotide primers; changing the 
buffer composition; and changing the number of amplification cycles. Fine-tuning the amplification 

25 conditions is a routine matter for a practitioner of ordinary skill in the art. 

In one method, a single primer of this invention is used in the amplification, optionally using a 
second primer, such as a random primer, to initiate replication downstream from the first primer and in the 
opposite direction. In a preferred method, at least two of the primers of this invention are used in the same 
reaction to initiate replication in opposite directions. The use of at least two specific primers enhances the 

30 specificity of the amplification reaction, and defines the size of the fragment for comparison between 
samples. For example, amplification may be performed using primers NIVPA and TVNCB. More preferred 
is the use of several sets of primers in a nested fashion to enhance the amplification. Nesting is 
accomplished by performing a first amplification using primers that generate an intermediate product, 
comprising one or more internal binding sites for additional primers. This is followed by a second 

35 amplification, using a new primer in conjunction with one from the previous set or two new primers. The 
second amplification product is therefore a subfragment of the first product. If desired, additional rounds of 
amplification can be performed using additional primers. 

Accordingly, nesting amplification reactions can be performed with any combination of three or 
more oligonucleotide primers comprising at least one primer with a sense orientation and one primer with 
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an antisense mentation. Preferably, primers are chosen so that intermediate amplification products are no 
more than about 2000 base pairs; more preferably, they are no more than about 1500 base pairs; even 
more preferably, they are no more than about 750 base pairs. Preferably, the innermost primers provide a 
final amplification product of no more than about 1200 base pairs; more preferably, they are no more than 
5 about 750 base pairs; even more preferably, they are no more than about 500 base pairs. Accordingly, a 
preferred combination is at least three primers selected from FAYDA, IYGKA, CYSRA, NIDFB, NVFDB, 
and FREYB. Another preferred combination is at least three primers selected from FRFDA, NIVPA. 
TVNCA, NIDFB, NVFDB, and FREYB. 

Particularly preferred is a first amplification using primer FRFDA and TVNCB, followed by a 
10 second amplification using primer NIVPA and TVNCB. When performed on a polynucleotide from a 
Glycoprotein B gene of KSHV, the size of the final fragment including the primer binding regions is about 
386 bases. 

The amplified polynucleotides can be characterized at any stage during the amplification reaction, 
for example, by size determination. Preferably, this is performed by running the polynucleotide on a gel of 

15 about 1-2% agarose. If present in sufficient quantity, the polynucleotide in the gel can be stained with 
ethidium bromide and detected under ultraviolet light Alternatively, the polynucleotide can be labeled with 
a radioisotope such as M P or before loading on a gel of about 6% potyacrytamide, and the gel can 
subsequently be used to produce an autoradiogram. A preferred method of labeling the amplified 
polynucleotide is to end-label an oligonucleotide primer such as NIVPA with M P using a polynucleotide 

20 kinase and gamma-pP^ATP, and continuing amplification for about 5-1 5 cycles. 

If desired, size separation may also be used as a step in the preparation of the amplified 
polynucleotide. This is particularly useful when the amplification mixture is found to contain artifact 
polynucleotides of different size, such as may have arisen through cross-reactivity with undesired targets. 
A separating gel, such as described in the preceding paragraph, is dried onto a paper backing and used to 

25 produce an autoradiogram. Positions of the gel corresponding to the desired bands on the autoradiogram 
are cut out and extracted by standard techniques. The extracted polynucleotide can then be characterized 
directly, cloned, or used for a further round of amplification. 

The oligonucleotides NIVPASQ, TVNCBSQ, fYGKASQ, CYSRASQ, and NIDFBSQ are each 
derived from a consensus-degenerate Type 1 oligonucleotide. They retain the consensus segment, but 

30 lack the degenerate segment They are useful, inter alia, in sequencing of a Glycoprotein B polynucleotide 
fragment successfully amplified using a consensus-degenerate oligonucleotide. 

Unwanted polynucleotides in a mixture from an amplification reaction can also be proportionally 
reduced by shifting to primers of this type. For example, an initial 3-5 cycles of amplification can be 
conducted using primers NIVPA and TVNCB at 1/5 to 1/25 the normal amount Then a molar excess (for 

35 example, 50 pmoi) of NJVPASQ and/or TVNCBSQ is added, and the amplification is continued for an 
additional 30-35 cycles. This reduces the complexity of the oligonucleotides present in the amplification 
mixture, and permits the reaction temperatures to be increased to reduce amplification of unwanted 
polynucleotides. 
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Type 2 oligonucleotide primers for Glycoprotein B of the RFHV/KSHV subfamily 

Type 2 oligonucleotides are intended for detection or amplification reactions for the Glycoprotein B 
of any virus of the RFHV/KSHV subfamily. TTiey are designed from segments of the Glycoprotein B 
5 encoding region that are relatively well conserved between RFHV and KSHV, but not other previously 
sequenced gamma herpes viruses. Preferred examples are shown in Table 6: 



TABl£6rType2(Migm 


Target ^ 


De«lg- 
nation 


:: " : ^Seqtience ' " . 


Length : 


forms . 


Orten- 
taOon 


SEQ 

ID: 


SHMDA 


AGACCCGTGCCACTCTATGARATHAGYCAYA 
TGGA 


35 


24 


sense 


41 


SHMOASQ 


AGACCCGTGCCACTCTATGA 


20 


1 




42 


CFSSB 


GTTCACAACAATCTTCATNGARCTRAARCA 


30 


32 


anti- 
sense 


43 


CFSSBSQ 


GTTCACAACAATCTTCAT 


18 


1 




44 


ENTFA 


GTCAACGGAGTAGARAAYACNTTYACNGA 


29 


128 


sense 


45 


DNIQB 


ACTGGCTGGCTAAAGTACCTTTGAATRTTRTC 
NGT 


35 


16 


anti- 
sense 


46 


| DNIQBSQ 


ACTGGCTGGCTAAAGTACCTTTG 


23 


1 




47 



10 Type 2 oligonucleotides may be used for many purposes where specificity for the RFHV/KSHV 

subfamily specificity is desired. This includes the detection or amplification of Glycoprotein B from 
known viruses of the RFHV/KSHV subfamily, or characterization of Glycoprotein B from new members 
of the family. 

SHMDA, CFSSB, ENTFA, and DNIQB are consensus-degenerate oligonucleotides with a 
15 degenerate 3' end, useful as initial primers for PGR amplifications, including polynucleotides of the 
RFHV/KSHV subfamily which are not identical to either RFHV or KSHV. SHMDASQ, CFSSBSQ, and 
DNIQBSQ contain only a consensus segment, and are useful for example in labeling or sequencing 
polynucleotides already amplified using the consensus-degenerate oligonucleotides. 

In one application, these Type 2 oligonucleotides are used individually or in combination as 
20 amplification primers. In one example of this application, the oligonucleotides are used directly on DNA 
obtained from a tissue sample to obtain a Glycoprotein B from the RFHV/KSHV subfamily, but not more 
distantly related viruses that may be present in the same tissue, such as hEBV, hCMV or HSV1. Thus, 
SHMDA and DNIQB may be used as primers in a PCR, optionally preamplified using Type 1 
oligonucleotides such as NIVPA and TVNCB. Other combinations are also suitable. In another 
25 example, one of the Type 2 oligonucleotides of Table 6 is used in combination with a suitable Type 1 

3H 
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oligonucleotide listed earlier. Thus, NIVPA may be used in combination with DNIQB, or SHMDA may 
be used in combination with TVNCB as primers in a PCR. The DNA source may optionally be 
preamplified using NIVPA and TVNCB. Other combinations are also suitable. 

In another application, Type 2 oligonucleotides, or oligonucleotides comprising these sequences 
5 or fragments thereof, are used as probes in a detection assay. For example, they can be provided with 
a suitable label such as **P t and then used in a hybridization assay with a suitable target such as DNA 
amplified using FRFDA and/or NIVPA, along with TVNCB. 

Type 3 oligonucleotide primers specific for Glycoprotein B of RFHV or KSHV 

10 

Type 3 oligonucleotides are intended for detection or amplification reactions specific for a 
particular virus. They are non-degenerate segments of the Glycoprotein B encoding region of RFHV or 
KSHV that are relatively more variable between these two viruses and against other herpes viruses 
than are other segments of the region. Preferred examples are shown in Table 7, and in the Example 
15 section. 
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TABLE 7: Ty pe 3 OllgonucteoW^ Amplifying, or Characterizing 
Herpes Vfrue Polynucleotides encoding Glycoprotein B 


Target Glycoprotein B from RFHV 


Desig- 

nation 


: * Sequence 

■ V\ 
■ . :. \9 TO 3 ) 


Length 


No. of 
forme 


Orien-tation 


SEQ 
ID: 


bMltD 


1 ot» 1 oCTTCTGTCATACCGCG 


21 




anti-sense 


48 


aaitb 


TATTTGTTTGTGATTGCTGCT 


21 




anti-sense 


.49 


GMTEA 


GCGGTATGACAGAAGCAGCAA 


21 




sense 


60 


KYEIA 


AACAAATATGAGATCCCCAGG 


21 




sense 


51 


TDRDB 


TCATCCCGATCGGTGAACGTA 


21 




anti-sense 


52 


VEGLB 


TTGTCAGTTAGACCTTCGACG 


21 




anti-sense 


53 


VEGLA 


CCCGTCGAAGGTCTAACTGAC 


21 




sense 


54 


PVLYA 


AGCCAACCAGTACTGT ACTCT 


21 




sense 


55 


G^coprote/n B from KSHV 




Desig* 
nation 


'"Sequence 

(ytoa*) 


Length 


NO. Of 

forme 


Orientation 


SEQ 
ID: 


GLTEB 


TGATGGCGGACTCTGTCAAGC 


21 




anti-sense 


56 


TNKYB 


GTTCATACTTGTTGGTGATGG 


21 




anti-sense 


57 


GLTEA 


GGGCTTGACAGAGTCCGCCAT 


21 




sense 


58 


YELPA 


AGAAGTATGAACTCCCGAGAC 


21 




sense 


59 


VNVNB 


ACCCCGTTGACATTTACCTTC 


21 




anti-sense 


60 


TFTDV 


TCGTCTCTGTCAGTAAATGTG 


21 




anti-sense 


61 


TVFLA 


CCACAGTATTCCTCCAACCAG 


21 




sense 


62 


SQPVA 


GGTACT7TAGCCAGCCGGTCA 


21 




sense 


63 



GMTEB, AAfTB. GMTEA. KYEIA, TDRDB, VEGLB. VEGLA, and PVLYA are specific 
non-degenerate oligonucleotides for the RFHV Glycoprotein B, and can be used for the amplification or 
detection of Glycoprotein B encoding polynucleotides of RFHV origin. Amplification is preferably done 
using the oligonucleotides in a nested fashion: e.g., a first amplification is conducted using GMTEA and 
VEGLB as primers; then a second amplification is conducted using KYEIA and TDRDB as primers. 
This provides an extremely sensitive amplification assay that is specific for RFHV Glycoprotein B. 
GMTEB and AAITB hybridize near the 5* end of the fragment, and may be used in combination with up- 
stream hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 5' direction. VEGLA 
and PVLYA hybridize near the 3' end of the fragment, and may be used in combination with down- 
stream hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 3' direction. 

Similarly. GLTEB, TNKYB, GLTEA, YELPA, VNVNB, ENTFB, SQPVA. and TVFLA are specific 
non-degenerate oligonucleotides for the KSHV Glycoprotein B, and can be used in a similar fashion, 
including as primers for an amplification reaction. Preferably, the amplification is done using the 
oligonucleotides in a nested fashion: e.g., a first amplification is conducted using GLTEA and ENTFB 

3b 



WO 97/12042 



PCT/US96/15702 



as primers; then a second amplification is conducted using YELPA and VNVNB as primers. This 
provides an extremely sensitive amplification assay that is specific for KSHV Glycoprotein B. GLTEB 
and TNKYB hybridize near the 5* end of the fragment, and may be used in combination with up-stream 
hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 5* direction. SQPVA and 
5 TVFLA hybridize near the 3* end of the fragment, and may be used in combination with down-stream 
hybridizing Type 1 oligonucleotides to amplify or detect sequences in the 3' direction. 

Practitioners skilled in the art will immediately recognize that oligonucleotides of Types 1, 2 and 
3 (in particular, those shown in Tables 4, 6 and 7) can be used in combination with each other in a PGR 
to amplify different sections of a Glycoprotein B encoding polynucleotide. The specificity of the 
10 amplification reaction generally is determined by the primer with the least amount of cross reactivity. 
The size and location of the amplified fragment is determined by the primers used in the final round of 
amplification. For example, NIVPA used in combination with SQPVB will amplify about 310 bases of 
Glycoprotein B encoding polynucleotide from a virus closely related to KSHV. Suitable combinations of 
oligonucleotides may be used as amplification primers in a nested fashion. 

15 

Use of synthetic oligonucleotides to characterize polynucleotide targets 

As described in the previous section, the oligonucleotides embodied in this invention, can be 
used as primers for amplification of polynucleotides encoding a herpes virus Glycoprotein B, particularly 

20 in a polymerase chain reaction. 

The conditions for conducting the PCR depend on the nature of the oligonucleotide being used. 
In particular, when using oligonucleotides comprising a degenerate segment, or a consensus segment 
that is only partly identical to the corresponding segment of the target, and when the target 
polynucleotide comprises an unknown sequence, the selection of conditions may be important to the 

25 success of the amplification. Optimizing conditions for a new primer or new polynucleotide target are 
routine for a practitioner of ordinary skill. What follows is a guide to assist in that objective. 

First, the temperature of the annealing step of the PCR is optimized to increase the amount of 
target polynucleotide being amplified above the amount of unrelated polynucleotide amplified. Ideally, 
the temperature permits the primers to hybridize with the target sequence but hot with other sequences. 

30 For primers comprising a consensus segment (Type 1 ), the temperature of the annealing step in repeat 
cycles of a PCR is generally at least about 45*C; preferably it is at least about 50*C. It is also preferable 
to conduct the first few cycles of the PCR at even higher temperatures, such as 55*C or even 60*C. 
The higher temperature will compel the annealing to be more sequence specific during the cycle and 
will reduce the background amplification of unrelated sequences. Annealing steps for subsequent 

35 cycles may be performed under slightly less stringent conditions to improve the rate of amplification. In 
an especially preferred procedure, the first PCR amplification cycle comprises an annealing step of 
about 1 min conducted at 60*C. Annealing steps in subsequent cycles are conducted at 2'C lower each 
cycle, until a temperature of 50*C is reached. Further cycles are then conducted with annealing steps 
at 50*C, until the desired degree of amplification is achieved. 
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Primers which are virus-specific and do not contain a consensus segment (Type 3) are more 
selective, and may be effective over a broader temperature range. Preferred temperatures for the 
annealing step in PCR amplification cycles are between 50"C and 65*C. 

Second, the buffer conditions are optimized. We have found that buffers supplied with 
5 commercial preparations of Taq polymerase are sometimes difficult to use, in part because of a critical 
dependence on the concentration of magnesium ion. PCRs performed using the oligonucleotides of 
this invention generally are more easily performed using a buffer such as that suggested by M. Wigler 
(Usitsyn et al.). Preferably, the final PCR reaction mixture contains (NH^O, instead of KCI as the 
principal ion source. Preferably, the concentration of (NH 4 )2S0 4 in the final reaction mixture is about 5- 

10 50 mM, more preferably about 10-30 mM, even more preferably 16 mM. The buffering component is 
preferably Tris, preferably at a final concentration of about 67 mM and a pH of about 8.8. Under these 
conditions, the MgCI 2 concentration is less critical. Preferably the final concentration is about 1-10 mM, 
more preferably it is about 3-6 mM, optimally it is about 4 mM. The reaction mixture may also contain 
about 10 mM O-mercaptoethanol and 0.05-1 mg/mL bovine serum albumin. An especially preferred 

15 buffer is WB4 buffer (67 mM Tris buffer pH 8.8, 4 mM MgCI 2 , 16 mM (NH^SO* 10 mM &- 
mercaptoethanol and 0.1 mg/mL albumin. Preferred conditions for performing the reaction are provided 
below in Example 3. 

To conduct the PCR reaction, a mixture comprising the oligonucleotide primers, the four 
deoxynucleotides, a suitable buffer, the DNA to be amplified, and a heat-stable ONA-dependent DNA 

20 polymerase is prepared. The mixture is then processed through temperature cycles for the annealing, 
elongating, and melting steps until the desired degree of amplification is achieved. The amount of DNA 
produced can be determined, for example, by staining with ethidium bromide, optionally after separating 
amplified fragments on an agarose gel. 

A possible complication of the amplification reaction is dimerization and amplification of the 

25 oligonucleotide primers themselves. This can be easily detected as low molecular weight (<100 base 
pair) fragments on an agarose gel. Amplified primer can be removed by agarose or polyacrylamide gel 
separation. The amount of amplified dimer may be reduced by minor adjustments to the conditions of 
the reaction, particularly the temperature of the annealing step. It is also preferable to pre-mix the 
primers, the deoxynucleotides, and the buffer, and heat the mixture to 80 degrees before adding the 

30 DNA to be amplified. 

Amplification reactions using any the oligonucleotides of this invention as primers yield 
polynucleotide fragments encoding a portion of a Glycoprotein B. These fragments can be 
characterized by a number of techniques known to a practitioner of ordinary skill in the art Some 
non-limiting methods for characterizing a fragment are as follows: 

35 In one method, a fragment may be sequenced according to any method of sequence 

determination known in the art, including the Maxam & Gilbert method, or the Sanger & Nicholson 
method. Alternatively, the fragment may be submitted to any of the commercial organizations that 
provide a polynucleotide sequencing service. The fragment may optionally be cloned and/or amplified 
before sequencing. The nucleotide sequence can be used to predict the amino acid sequence encoded 
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by the fragment. Sequence data can be used for comparison with other sequenced Glycoprotein B's, 
either at the polynucleotide level or the amino acid level, to identify the species of herpes virus present 
in the original source material. Sequence data can also be used in modeling algorithms to predict 
antigenic regions or three-dimensional structure. 
5 In a second method of characterizing, the size of the fragment can be determined by any suitable 

method, such as running on a polyacrylamide or agarose gel, or centrifuging through an appropriate 
density gradient For example, for RFHV and KSHV, the fragment between NfVPA and TVNCB is 
about 319 bases. Hence, the length of the entire amplified fragment including primer binding regions is 
about 386 bases. The corresponding fragment of sHV1 contains an additional 6 base pairs. The sHV1 

10 fragment can therefore be distinguished from that of RFHV or KSHV, for example, by running amplified 
polynucleotide fragments from each in neighboring lanes of a separating gel, or by running the sHV1 
fragment beside suitable molecular weight standards. Polynucleotide fragments identical in size to that 
of RFHV and KSHV may be from the same or a related viral species. Fragments substantially different 
in size are more likely to be derived from a different herpes virus. 

15 In a third method of characterizing, a fragment can be tested by attempting to hybridize it with an 

oligonucleotide probe. In a preferred example, a fragment is tested for relatedness to the Glycoprotein 
B encoding region of RFHV or KSHV. The test is conducted using a probe comprising a sequence of a 
Glycoprotein B encoding region, or its genetic complement. Suitable probes are polynucleotides 
comprising sequences from RFHV or KSHV, such as the Type 3 oligonucleotides listed in Table 7. 

20 The length and nature of the probe and the hybridization conditions are selected depending on 

the objectives of the test. If the objective is to detect only polynucleotides from RFHV or KSHV, 
Including minor strain variants, then hybridization is performed under conditions of high stringency. A 
sequence from the respective Glycoprotein B is used. Longer length sequences improve the specificity 
of the test and can be used under conditions of higher stringency. Preferably, the probe will comprise a 

25 Glycoprotein B sequence of at least about 30 nucleotides; more preferably, the sequence will be at 
least about 50 nucleotides; even more preferably, the sequence will be at least about 75 nucleotides in 
length. 

If the objective is to detect polynucleotides that are closely related but not identical to RFHV or 
KSHV, such as in a screening test or a test to recruit previously undescribed viruses of the RFHV/KSHV 

30 subfamily, then different conditions are chosen. Sequences from RFHV or KSHV may be used, but a 
mixture of the two or a degenerate probe is generally preferred. The length of the sequence and the 
conditions of the hybridization reaction are selected to provide sufficient specificity to exclude unwanted 
sequences, but otherwise provide a maximum degree of cross-reactivity amongst potential targets. 
Suitable conditions can be predicted using the formulas given earlier, by calculating the T m and then 

35 calculating the corresponding temperature for the maximum degree of mismatch to be tolerated. The 
suitability of the conditions can be tested empirically by testing the cross-reactivity of the probes with 
samples containing known target polynucleotides encoding herpes Glycoprotein B. 

The minimum degree of complementarity required for a stable duplex to form under the 
conditions of the assay will determine what Glycoprotein B sequences will hybridize with the probe. 



WO 97/12042 



PCI7US96/15702 



Consider, for example, a target obtained from a human or non-human primate, amplified to produce a 
fragment corresponding to bases 36-354 of SEQ. ID NO:3, and then probed with the corresponding 
fragment of the KSHV polynucleotide. According to the data in Table 2, if the hybridization reaction is 
performed under conditions that require only about 50% identity for a stable duplex to form, the probe 
5 may hybridize with targets from any of the sequenced gamma herpes Glycoprotein B genes, including 
hEBV and sHV1. If the reaction is performed under conditions that require at least about 65% identity 
between probe and target, preferably at least about 67% identity, more preferably at least about 70% 
identity, and even more preferably at least about 75% identity for a stable duplex to form, the assay will 
detect a target polynucleotide from the RFHV7KSHV subfamily; i.e., either RHFV, KSHV, or a closely 
10 related herpes virus with a Glycoprotein B polynucleotide not yet sequenced. Even under hybridization 
conditions that required only about 50-55% identity for a stable duplex to form, a positive reaction would 
not indicate the presence of bHV4, eHV2, or mHV68, since these viruses are not believed to be capable 
of infecting primates. 

It is possible to combine characterization by size and characterization by hybridization. For 
15 example, the amplified polynucleotide may be separated on a gel of acrylamide or agarose, blotted to a 
membrane of suitable material, such as nitrocellulose, and then hybridized with a probe with a suitable 
label, such as M P. The presence of the label after washing reflects the presence of hybridizable 
material in the sample, while the migration distance compared with appropriate molecular weight 
standards reflects the size of the material. A fragment sequence hybridizing with one of the 
20 aforementioned probes under conditions of high stringency but having an unexpected size would 
indicate a Glycoprotein B sequence with a high degree of identity to the probe, but distinct from either 
RFHV or KSHV. 

Use of polynucleotides and oligonucleotides to detect herpes virus Infection 

25 

Polynucleotides encoding herpes vims Glycoprotein B, and synthetic oligonucleotides based 
thereupon, as embodied in this invention, are useful in the diagnosis of clinical conditions associated 
with herpes virus infection. For example, the presence of detectable herpes Glycoprotein B in a clinical 
sample may suggest that the respective herpes virus participated as an etiologic agent in the 

30 development of the condition. The presence of viral Glycoprotein B in a particular tissue, but not in 
surrounding tissue, may be useful in the localization of an infected lesion. Differentiating between 
gamma herpes virus and other herpes viruses in clinical samples may be useful in predicting the clinical 
course of an infection or selecting a drug suitable for treatment. Since Glycoprotein B is expressed by 
replicative virus. L-particles, and infected cells, we predict that it will serve as a useful marker for active 

35 and quiescent stages of the disease that involve expression of the protein in any of these forms. 

The procedures for conducting diagnostic tests are extensively known in the art and are routine 
for a practitioner of ordinary skill. Generally, to perform a diagnostic method of this invention, one of the 
compositions of this invention is provided as a reagent to detect a target in a clinical sample with which 
it reacts. For example, a poiynudeotide of this invention may be used as a reagent to detect a DNA or 

40 RNA target such as might be present in a cell infected with a herpes vims. A polypeptide of this 
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invention may be used as a reagent to detect a target with which It is capable of forming a specific 
complex, such as an antibody molecule or (if the polypeptide is a receptor) the corresponding ligand. 
An antibody of this invention may be used as a reagent to detect a target it specifically recognizes, such 
as a polypeptide expressed by vtrally infected cells. 
5 The target is supplied by obtaining a suitable tissue sample from an individual for whom the 

diagnostic parameter is to be measured. Relevant test samples are those obtained from individuals 
suspected of harboring a herpes virus. Many types of samples are suitable for this purpose, including 
those that are obtained near the suspected site of infection or pathology by biopsy or surgical 
dissection, in vitro cultures of cells derived therefrom, solubilized extracts, blood, and blood 

10 components. If desired, the target may be partially purified from the sample or amplified before the 
assay is conducted. The reaction is performed by contacting the reagent with the sample under 
conditions that will allow a complex to form between the reagent and the target. The reaction may be 
performed in solution, or on a solid tissue sample, for example, using histology sections. The formation 
of the complex is detected by a number of techniques known in the art. For example, the reagent may 

15 be supplied with a label and unreacted reagent may be removed from the complex; the amount of 
remaining label thereby indicating the amount of complex formed. Further details and alternatives for 
complex detection are provided in the descriptions that follow. 

To determine whether the amount of complex formed is representative of herpes infected or 
uninfected cells, the assay result is preferably compared with a similar assay conducted on a control 

20 sample. It is generally preferable to use a control sample which is from an uninfected source, and 
otherwise similar in composition to the clinical sample being tested. However, any control sample may 
be suitable provided the relative amount of target in the control is known or can be used for comparative 
purposes. It is often preferable to conduct the assay on the test sample and the control sample 
simultaneously. However, if the amount of complex formed is quantifiable and sufficiently consistent, it 

25 is acceptable to assay the test sample and control sample on different days or in different laboratories. 

Accordingly, polynucleotides encoding Glycoprotein B of the RFHV/KSHV subfamily, and the 
synthetic oligonucleotides embodied in this invention, can be used to detect gamma herpes vims 
polynucleotide that may be present in a biological sample. General methods for using polynucleotides 
in specific diagnostic assays are well known in the art: see, e.g., Patent Application JP 5309000 

30 (latron). 

An assay employing a polynucleotide reagent may be rendered specific, for example: 1) by 
performing a hybridization reaction with a specific probe; 2) by performing an amplification with a 
specific primer, or 3) by a combination of the two. 

To perform an assay that is specific due to hybridization with a specific probe, a polynucleotide is 
35 chosen with the required degree of complementarity for the intended target Preferred probes include 
polynucleotides of at least about 16 nucleotides in length encoding a portion of the Glycoprotein B of 
RFHV, KSHV, or another member of the RFHV/KSHV subfamily. Increasingly preferred are probes 
comprising at least about 18, 21, 25, 30, 50, or 100 nucleotides of the Glycoprotein B encoding region. 
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Also preferred are degenerate probes capable of forming stable duplexes with polynucleotides of the 
RFHV/KSHV subfamily under the conditions used, but not polynucleotides of other herpes viruses. 

The probe is generally provided with a label. Some of the labels often used in this type of assay 
include radioisotopes such as and "P, chemiluminescent or fluorescent reagents such as 
5 fluorescein, and enzymes such as alkaline phosphatase that are capable of producing a colored solute 
or precipitant. The label may be intrinsic to the reagent, it may be attached by direct chemical linkage, 
or it may be connected through a series of intermediate reactive molecules, such as a biotin-avidin 
complex, or a series of inter-reactive polynucleotides. The label may be added to the reagent before 
hybridization with the target polynucleotide, or afterwards. To improve the sensitivity of the assay, it is 
10 often desirable to increase the signal ensuing from hybridization. This can be accomplished by using a 
combination of serially hybridizing polynucleotides or branched polynucleotides in such a way that 
multiple label components become incorporated into each complex. See U.S. Patent No. 5,124,246 
(Urdea et al.). 

If desired, the target polynucleotide may be extracted from the sample, and may also be partially 
15 purified. To measure viral particles, the preparation is preferably enriched for DNA; to measure active 
transcription of Glycoprotein B, the preparation is preferably enriched for RNA. Generally, it is 
anticipated that the level of polynucleotide of a herpes vims will be low in clinical samples: there may 
be just a few copies of DNA encoding the Glycoprotein B per cell where the vims is latent, or up to 
several hundred copies of DNA per cell where the virus is replicating. The level of mRNA will be higher 
20 in cells where the protein is actively expressed than those where the gene is inactive. It may therefore 
be desirable to enhance the level of target in the sample by amplifying the DNA or RNA. A suitable 
method of amplification is a PCR, which is preferably conducted using one or more of the 
oligonucleotide primers embodied in this invention. RNA may be amplified by making a cDNA copy 
using a reverse transcriptase, and then conducting a PCR using the aforementioned primers. 
25 The target polynucleotide can be optionally subjected to any combination of additional 

treatments, including digestion with restriction endonucteases, size separation, for example by 
electrophoresis in agarose or polyacrytamide, and affixation to a reaction matrix, such as a blotting 
material. 

Hybridization is allowed to occur by mixing the reagent polynucleotide with a sample suspected 
30 of containing a target polynucleotide under appropriate reaction conditions. This may be followed by 
washing or separation to remove unreacted reagent. Generally, both the target polynucleotide and the 
reagent must be at least partly equilibrated into the single-stranded form in order for complementary 
sequences to hybridize efficiently. Thus, it may be useful (particularly in tests for DNA) to prepare the 
sample by standard denaturation techniques known in the art. 
35 The level of stringency chosen for the hybridization conditions depends on the objective of the 

test. If it is desired that the test be specific for RFHV or KSHV, then a probe comprising a segment of 
the respective Glycoprotein B is used, and the reaction is conducted under conditions of high 
stringency. For example, a preferred set of conditions for use with a preferred probe of 50 nucleotides 
or more is 6 x SSC at 37*C in 50% formamide, followed by a wash at low ionic strength. This will 
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generally require the target to be at least about 90% identical with the polynucleotide probe for a stable 
duplex to form. The specificity of the reaction for the particular virus in question can also be increased 
by increasing the length of the probe used. Thus, longer probes are particularly preferred for this 
application of the invention. Alternatively, if it is desired that the test also be able to detect other herpes 
5 viruses related to KSHV, then a lower stringency is used. Suitable probes include fragments from the 
KSHV Glycoprotein B polynucleotide, a mixture thereof, or oligonucleotides such as those listed in 
Table 7. 

Appropriate hybridization conditions are determined to permit hybridization of the probe only to 
Glycoprotein B sequences that have the desired degree of identity with the probe. The stringency 

10 required depends on the length of the polynucleotide probe, and the degree of identity between the 
probe and the desired target sequence. Consider, for example, a probe consisting of the KSHV 
polynucleotide fragment between the hybridization sites of NIVPA and 7VNCB. Conditions requiring a 
minimum identity of 60% would result in a stable duplex formed with a corresponding polynucleotide of 
KSHV and other gamma herpes viruses such as sHVt; conditions requiring a minimum identity of 90% 

15 would result in a stable duplex forming only with a polynucleotide from KSHV and closely related 
variants. Conditions of intermediate stringency requiring a minimum identity of 65-70% would permit 
duplexes to form with a Glycoprotein B polynucleotide of KSHV, and some other members of the 
RFHV/KSHV subfamily, but not with corresponding polynucleotides of other known herpes viruses, 
including gamma herpes viruses eHV2, sHV1, mHV66, bHV4, EBV, and other human pathogens such 

20 as hCMV, hHV6, hVZV, and HSV1 . 

Conditions can be estimated beforehand using the formula given earlier. Preferably, the exact 
conditions are confirmed by testing the probe with separate samples known to contain polynucleotides, 
both those desired to be detected and those desired to go undetected in the assay. Such samples may 
be provided either by synthesizing the polynucleotides from published sequences, or by extracting and 

25 amplifying DNA from tissues believed to be infected with the respective herpes vims. Determining 
hybridization conditions is a matter of routine adjustment for a practitioner of ordinary skill, and does not 
require undue experimentation. Since eHV2, sHV1, mHV68, bHV4 and EBV are more closely identical 
to the RFHV/KSHV subfamily than alpha and beta herpes viruses, conditions that exclude gamma 
herpes viruses outside the RFHV/KSHV subfamily will generally also exclude the other herpes viruses 

30 listed in Table 1 . In addition, if it is believed that certain viruses will not be present in the sample to be 
tested in the ultimate determination (such as eHV2, mHV68 or bHV4 in a human tissue sample), then 
the corresponding target sequences may optionally be omitted when working out the conditions of the 
assay. Thus, conditions can be determined that would permit Type 2 oligonucleotide probes such as 
those listed in Table 6 to form a stable duplex both with polypeptides comprising SEQ. ID NO:1 or SEQ. 

35 ID NO:3, but not a sequence selected from the group consisting of SEQ. ID NO:5-13. Conditions can 
also be determined that would permit a suitable fragment comprising at least 21 or more consecutive 
bases of SEQ. ID NO:1 or SEQ. ID NO:3 to form a stable duplex both with a polynucleotide comprising 
SEQ. ID NO:1 and SEQ. ID NO:3, but not a polynucleotide comprising any one of SEQ. ID NO:5-13. 
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Alternatively, to conduct an assay that is specific due to amplification with a specific primer: 
DNA or RNA is prepared from the biological sample as before. Optionally, the target polynucleotide is 
pre-amplrfied in a PCR using primers which are not species specific, such as those listed in Table 4 or 
6. The target is then amplified using specific primers, such as those listed in Table 7, or a combination 
5 of primers from Table 4, 6, and 7. In a preferred embodiment, two rounds of amplification are 
performed, using oligonucleotide primers in a nested fashion: virus-specific or non-specific in the first 
round; virus-specific in the second round. This provides an assay which is both sensitive and specific 

Use of a specific Type 3 primer during amplification is sufficient to provide the required 
specificity. A positive test may be indicated by the presence of sufficient reaction product at the end of 
1 0 the amplification series. Amplified polynucleotide can be detected, for example, by blotting the reaction 
mixture onto a medium such as nitrocellulose and staining with ethidium bromide. Alternatively, a 
radiolabeled substrate may be added to the mixture during a final amplification cycle; the incorporated 
label may be separated from unincorporated label (e.g., by blotting or by size separation), and the label 
may be detected (e.g. by counting or by autoradiography). If run on a gel of agarose or polyacrylamide. 
15 the size of the product may help confirm the identity of the amplified fragment. Specific amplification 
can also be followed by specific hybridization, by using the amplification mixture obtained from the 
foregoing procedure as a target source for the hybridization reaction outlined earlier. 

U$eof polynucleotides for gene therapy 

20 

Embodied in this invention are pharmaceuticals comprising virus-specific polynucleotides, 
polypeptides, or antibodies as an active ingredient. Such compositions may decrease the pathology of 
the virus or infected cells on their own. or render the virus or infected cells more susceptible to 
treatment by non-specific pharmaceutical compounds. 

25 Polynucleotides of this invention encoding part of a herpes virus Glycoprotein B may be used, for 

example, for administration to an infected individual for purposes of gene therapy (see generally U.S. 
Patent No. 5,399,346: Anderson et al.). The general principle is to administer the polynucleotide in 
such a way that it ether promotes or attenuates the expression of the polypeptide encoded therein. 

A preferred mode of gene therapy is to provide the polynucleotide in such a way that it will be 

30 replicated inside the cell, enhancing and prolonging the effect. Thus, the polynucleotide is operatively 
linked to a suitable promoter, such as the natural promoter of the corresponding gene, a heterologous 
promoter that is intrinsically active in cells of the target tissue type, or a heterologous promoter that can 
be induced by a suitable agent. Entry of the polynucleotide into the cell is facilitated by suitable 
techniques known in the art. such as providing the polynucleotide in the form of a suitable vector, such 

35 as a viral expression vector, or encapsulation of the polynucleotide in a liposome. The polynucleotide 
may be injected systemically, or provided to the site of infection by an antigen-specific homing 
mechanism, or by direct injection. 

In one variation, the polynucleotide comprises a promoter linked to the polynucleotide strand with 
the same orientation as the strand that is transcribed during the course of a herpes virus infection. 

40 Preferably, the Glycoprotein B that is encoded includes an external component, a transmembrane 
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component and signal sequences for transport to the surface. Virally infected cells transfected with 
polynucleotides of this kind are expected to express an enhanced level of Glycoprotein B at the surface. 
Enhancing Glycoprotein B expression in this fashion may enhance recognition of these cells by 
elements of the immune system, including antibody (and antibody-dependent effectors such as ADCC), 
5 and virus-specific cytotoxic T cells. 

In another variation, the polynucleotide comprises a promoter linked to the polynucleotide strand 
with the opposite orientation as the strand that is transcribed during the course of a herpes vims 
infection. Virally infected cells transfected with polynucleotides of this kind are expected to express a 
decreased level of Glycoprotein B. The transcript is expected to hybridize with the complementary 
10 strand transcribed by the viral gene, and prevent it from being translated. This approach is known as 
anti-sense therapy. 

RFHV/KSHV subfamily polypeptides with Glycoprotein B activity and fragments thereof 

15 The RFHV and KSHV polynucleotide sequences shown in Figure 1 have open reading frames. 

The polypeptide encoded thereby are shown in SEQ. ID NO:2 and SEQ. ID NO:4, respectively. 
Encoded between the hybridizing regions of the primers NIVPA and TVNCB used to obtain the 
polynucleotide sequence is a 106 amino acid fragment of the Glycoprotein B molecule which is 91% 
identical between RFHV and KSHV. The full protein sequence of KSHV Glycoprotein B is shown in 

20 SEQ. ID NO:94. A Glycoprotein B fragment of a third member of the RFHV/KSHV subfamily, RFHV2, is 
shown in SEQ. ID NO:97. 

There are a number of homologous residues to Glycoprotein B molecules of other sequenced 
herpes viruses. The longest sequence contained in SEQ. ID NO:2 or SEQ. ID NO:4 but not in the 
known sequences of other herpes viruses is 9 amino acids in length, with two exceptions (SEQ. ID 

25 NOS:64 and 65). Longer matching portions are found elsewhere in the Glycoprotein B amino acid 
sequence. The longest is the 21 amino acid sequence from bHV4 shown in SEQ. ID NO:99; the rest 
are all 16 amino acids long or less. Other than SEQ. ID NO:99 exception, any fragment of the RFHV 
and KSHV Glycoprotein B protein sequence that is 17 amino adds or longer is believed to be specific 
for RFHV or KSHV. respectively, or to closely related strains. Since bHV4 and the other viruses with 

30 matching segments are not believed to be capable of infecting primates, any fragment of about 10 
amino acids or more found in a primate that was contained in SEQ. ID NO:4 would indicate the 
presence of an infectious agent closely related to KSHV. 

This invention embodies both intact Glycoprotein B from herpes viruses of the RFHV/KSHV 
subfamily, and any fragment thereof that is specific for the subfamily. Preferred Glycoprotein B 

35 fragments of this invention are at least 10 amino acids in length; more preferably they are at least 13 
amino acids in length; more preferably they are at least 17 amino acids in length; more preferably they 
are at least about 20 amino adds in length; even more preferably they are at least about 25 amino 
adds in length, still more preferably they are at least about 30 amino adds in length. 

The amino add sequence of the RFHV and KSHV Glycoprotein B fragment shown in SEQ. ID 

40 NOS:2 ( 4, 94 and 96 can be used to identify vims-specific and cross-reactive antigenic regions. 
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In principle, a specific antibody could recognize any amino acid difference between sequences 
that is not also shared by the species from which the antibody is derived. Antibody binding sites are 
generally big enough to encompass 5-9 amino acid residues of an antigen, and are quite capable of 
recognizing a single amino acid difference. Specific antibodies may be part of a polyclonal response 
5 arising spontaneously in animals infected with a virus expressing the Glycoprotein B. Specific 
antibodies may also be induced by injecting an experimental animal with either the intact Glycoprotein 
B or a Glycoprotein B fragment 

Thus, any peptide of 5 amino acids or more that is unique to KSHV is a potential virus-specific 
antigen, and could be recognized by a KSHV-spectfic antibody. Similarly, any peptide of sufficient 
10 length shared within the RFHV/KSHV subfamily but not with other herpes viruses is a potential 
subfamily-specific antigen. 

Some examples of preferred peptides are shown in Table 8. Practitioners in the art will 
immediately recognize that other peptides with similar specificities may be designed by minor 
alterations to the length of the peptides listed and/or moving the frame of the peptide a few residues in 
15 either direction. 

The Class I peptides shown in Table 8 are conserved between Glycoprotein B of KSHV and that 
of certain other members of the gamma herpes virus subfamily. An antibody directed against one such 
Glycoprotein B in this region may therefore cross-react with some of the others. Class II peptides are 
conserved between Glycoprotein B of RFHV and KSHV, but not with other gamma herpes viruses. An 

20 antibody directed against this region is expected to cross-react between RFHV, KSHV, and other 
viruses of the RFHV/KSHV subfamily; but not with herpes viruses outside the subfamily. Class III 
peptides are different between Glycoprotein B of RFHV. KSHV, and other known gamma herpes 
viruses. An antibody binding to this region, particularly to non-identical residues contained therein, is 
expected to distinguish RFHV and KSHV Glycoprotein B from each other, and from Glycoprotein B of 

25 more distantly related herpes viruses. 
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TABLE 8: Antigen Peptides 


Spe^fldHy • 


Sequence 


Length 


5EQ. ID 
NO: 


Class 1: : V>" 


Shared with 










bHV4 : 


YRKIATSVTVYRG 


13 


64 






RYFSQP 


6 


66 


Shared amongst RFHV/KSHV 
■ subfamily and some other 
gamma herpes viruses ; f |i 




IYAEPGWFPGIYRVR 
IYAEPGWFPGIYRVRT7VNCE 


15 
21 


65 
99 




rnHVB8 


VLEELSRAVVCREQVRD 


16 


100 


' Class II: 




VTVYRG 


6 


67 






AITNKYE 


7 


68 


Shared amongst ^HVTI^HViq 
subfdinity ■ 




onMUd 1 Y 


7 


69 






(^ntftdl) 


7 


70 






TVFLQPV 


7 


71 






TDNIQRY 


7 


72 


Class Ql: 


Specific for 












RGMTEAA 


7 


73 


Virus specific : ; 


kshv 


RGLTESA 


7 


75 




: RFHV 


PVLYSEP 


7 


74 




KSHV 


PVIYAEP 


7 


76 


1 - Not shared with arty otr^ 

present in some unsequenced RFHV/KSHV subfamily viruses 



Particularly preferred peptides from Class 111 are those encompassing regions of Glycoprotein B 
with the polarity characteristics appropriate for an antigen epitope, as described In the Example section. 
5 Given the complete sequence of a Glycoprotein B from KSHV and other members of the RFHV/KSHV 
subfamily, virus- or subfamily-specific peptides can be predicted for other regions of the molecule by a 
similar analysis. 

Preparation of polypeptides 

10 

Polypeptides of this invention may be prepared by several different methods, all of which will be 
known to a practitioner of ordinary skill. 

For example, short polypeptides of about 5-50 amino acids in length are conveniently prepared 
from sequence data by chemical synthesis. A preferred method is the solid-phase Merrifield technique. 
15 Alternatively, a messenger RNA encoding the desired polypeptide may be isolated or synthesized 
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according to one of the methods described earlier, and translated using an in vitro translation system, 
such as the rabbit reticulocyte system. See, e.g., Dorsky et al. 

Longer polypeptides, up to and including the entire Glycoprotein B, are conveniently prepared 
using a suitable expression system. For example, the encoding strand of a full-length cDNA can be 
5 operatively linked to a suitable promoter, inserted into an expression vector, and transfected into a 
suitable host cell. The host cell is then cultured under conditions that allow transcription and translation 
to occur, and the polypeptide is subsequently recovered. For examples of the expression and recovery 
of Glycoprotein B from other species of herpes virus, see. for example, U.S. Patent Nos. 4,642,333 
(Person); 5,244,792 (Burke et al.); Manservigi et al. 

10 For many purposes, it is particularly convenient to use a recombinant Glycoprotein B 

polynucleotide that includes the regions encoding signals for transport to the cell surface, but lacks the 
region encoding the transmembrane domain of the protein. The polynucleotide may be truncated 5' to 
the transmembrane encoding region, or it may comprise both extracellular and cytoplasmic encoding 
region but lack the transmembrane region. Constructs of this nature are expected to be secreted from 

15 the cell in a soluble form. Where It is desirable to have a Glycoprotein B fragment that is a monomer, 
the recombinant may be designed to limit translation to about the first 475 amino acids of the protein. 

For example, to express any of these forms of Glycoprotein B in yeast, a cassette may be 
prepared using the glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) promoter region and 
terminator region. GAPDH gene fragments are identified in a yeast library, isolated and ligated in the 

20 appropriate configuration. The cassette is cloned into pBR322, isolated and confirmed by DNA 
sequencing. A pCI/t plasmid is constructed containing a Glycoprotein B insert and GAPDH promoter 
and terminator regions. The plasmid is used to transform yeast strain S. cemvisiae. After culture, the 
yeast cells are pelleted by centrifugatton, resuspended in a buffer containing protease inhibitors such as 
1 mM phenylmethylsutfbnyl fluoride and 0.1 ng/ml pepstatin. The washed cells are disrupted by 

25 vortexing with glass beads and recentrifuged. The presence in the supernatant of a Glycoprotein B of 
the correct size may be confirmed, for example, by Western blot using an antibody against Glycoprotein 
B, prepared as described in a following section. Glycoprotein B may be purified from the supernatant 
by a combination of standard protein chemistry techniques, including ion exchange chromatography, 
affinity chromatography using antibody or substrate, and high-pressure liquid chromatography. 

30 To express Glycoprotein B in mammalian cells, for example, a mammalian expression vector 

such as pSV1/dhfr may be used. This has an ampicillirvresistance beta-lactamase gene, and a 
selectable mammalian cell marker, dihydrofolate reductase linked to the SV40 early promoter. 
Glycoprotein B polynucleotide blunt-end fragments are ligated into the pSV1/dhfr vector and digested 
with endonucleases to provide a cassette Including the SV40 promoter, the Glycoprotein encoding 

35 region, and the SV40 splice and polyadenylation sites. The plasmids are used, for example, to 
transform CHO cells deficient in dhfr, and transfectants are selected. Cells expressing Glycoprotein B 
may be identified, for example, by immunofluorescence using anti-Glycoprotein B as the primary 
antibody. 
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In another example, recombinant piasmids for expressing Glycoprotein B are cloned under 
control of the Rous sarcoma virus long terminal repeat in the episomal replicating vector pRP-RSV. 
This plasmid contains the origin of replication and early region of the human papovavirus BK, as well as 
the dhfr resistance marker. The vector is can then be used, for example, to transform human 293 cells. 
5 By using a Glycoprotein B encoding region devoid of the transmembrane spanning domain, the 
Glycoprotein B polypeptide is constitutively secreted into the culture medium at 0.15-0.25 pg/cell/day. 
In the presence of 0.6-6 jiM methotrexate, production may be increased 10- to 100-fold, because of an 
amplification of the episomal recombinant. Glycoprotein B prepared in this way are appropriate, inter 
alia, for use in diagnosis, and to prepare vaccines protective against new and recurrent herpes virus 
1 0 infections (Manservigi et al). 

Use of polypeptides to assess herpes virus Infection 

The polypeptides embodied in this invention may be used to detect or assess the status of a 
15 herpes virus infection in an individual in several different applications. 

In one application, a polypeptide encoding a portion of a herpes virus Glycoprotein B is supplied 
as a reagent for an assay to detect the presence of antibodies that can specifically recognize it. Such 
antibodies may be present, for example, in the circulation of an individual with current or past herpes 
virus exposure. 

20 The presence of antibodies to Glycoprotein B in the circulation may provide a sensitive and early 

indication of viral infection. Since Glycoprotein B is a functional component of the viral envelope, it is 
produced in greater quantity than other transcripts sequestered within the viral particle. Its distribution 
is wider than transcripts that appear only transiently in the life cycle of the virus. Furthermore, it may be 
expressed not only by intact virus, but also by non-infective products of virally infective cells, such as L- 

25 particles. Glycoprotein B from various species of herpes virus are known to be strongly immunogenic. 
Thus, detection of antibody to Glycoprotein B in an individual may be an indication of ongoing active 
herpes virus infection, latent infection, previous exposure, or treatment with a Glycoprotein B vaccine. 

Suitable clinical samples in which to measure antibody levels include serum or plasma from an 
individual suspected of having a gamma herpes virus infection. The presence of the antibody is 

30 determined, for example, by an immunoassay. 

A number of immunoassay methods are established in the art for performing the quantitation of 
antibody using viral peptides (see, e.g., U.S. Patent No. 5,350,671: Houghton et al.). For example, the 
test sample potentially containing the specific antibody may be mixed with a pre-determined 
non-limiting amount of the reagent polypeptide. The reagent may contain a directly attached label, such 

35 as an enzyme or a radioisotope. For a liquid-phase assay, unreacted reagents are removed by a 
separation technique, such as filtration or chromatography. Alternatively, the antibody in the sample 
may be first captured by a reagent on a solid phase. This may be, for example, the specific 
polypeptide, an antHmmunoglobulin, or Protein A. The captured antibody is then detected with a 
second reagent, such as the specific polypeptide, antHmmunoglobulin, or protein A with an attached 

40 label. At least one of the capture reagent or the detecting reagent must be the specific polypeptide. In 
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a third variation, cells or tissue sections containing the polypeptide may be overlaid first with the test 
sample containing the antibody, and then with a detecting reagent such as labeled antiimmunoglobulin. 
In all these examples, the amount of label captured in the complex is positively related to the amount of 
specific antibody present in the test sample. Similar assays can be designed in which antibody in the 
5 test sample competes with labeled antibody for binding to a limiting amount of the specific peptide. The 
amount of label in the complex is then negatively correlated with the amount of specific antibody in the 
test sample. Results obtained using any of these assays are compared between test samples, and 
control samples from an uninfected source. 

By selecting the reagent polypeptide appropriately, antibodies of a desired specificity may be 

10 detected. For example, if the intact Glycoprotein B is used, or a fragment comprising regions that are 
conserved between herpes virus, then antibodies detected in the test samples may be virus specific, 
cross-reactive, or both. A multi-epitope reagent is preferred for a general screening assay for 
antibodies related to herpes virus infection. To render the assay specific for antibodies directed either 
against RFHV or against KSHV, antigen peptides comprising non-conserved regions of the appropriate 

15 Glycoprotein B molecule are selected, such as those listed in Class III of Table 8. Preferably, a mixture 
of such peptides is used. To simultaneously detect antibodies against RFHV. KSHV, and closely 
related viruses of the gamma herpes family, but not sHV1 and EBV, antigen peptides are selected with 
the properties of those listed in Class II of Table 8. Preferably, a mixture of such peptides is used. 

Antibodies stimulated during a herpes virus infection may subside once the infection resolves, or 

20 they may persist as part of the immunological memory of the host. In the latter instance, antibodies due 
to current infection may be distinguished from antibodies due to immunological memory by determining 
the class of the antibody. For example, an assay may be conducted in which antibody in the test 
sample is captured with the specific polypeptide, and then developed with labeled antMgM or antMgG. 
The presence of specific antibody in the test sample of the IgM class indicates ongoing infection, white 

25 the presence of IgG antibodies alone indicates that the activity is due to immunological memory of a 
previous infection or vaccination. 



Use of polypeptides to design or screen anti-viral drugs 

Interfering with the Glycoprotein B gene or gene product would modify the infection process, or 
the progress of this disease. H is an objective of this invention to provide a method by which useful 
pharmaceutical compositions and methods of employing such compounds in the treatment of gamma 
herpes vims infection can be developed and tested. Particularly preferred are pharmaceutical 
compounds useful in treating infections by RFHV, KSHV and other members of the RFHV/KSHV 
subfamily. Suitable drugs are those that interfere with transcription or translation of the Glycoprotein B 
gene, and those that interfere with the biological function of the polypeptide encoded by the gene. It is 
not necessary that the mechanism of interference be known; only that the interference be preferential 
for reactions associated with the infectious process. 

Preferred drugs include those that competitively interfere with the binding of the Glycoprotein B 
to its substrate on target cells, such as heparan sulfate and its analogs. Also preferred are drugs that 

SO 
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competitively interfere with any interaction of Glycoprotein B to other viral envelope components that 
may be necessary for the virus to exert one of its biologic functions, such as penetration of target cells. 
Also preferred are molecules capable of cross-linking or otherwise immobilizing the Glycoprotein B, 
thereby preventing it from binding its substrate or performing any biological function that plays a role in 
5 viral inactivity. 

This invention provides methods for screening pharmaceutical candidates to determine which 
are suitable for dinical use. The methods may be brought to bear on antiviral compounds that are 
currently known, and those which may be designed in the future. 

The method involves combining an active Glycoprotein B with the pharmaceutical candidate, and 

10 determining whether the biochemical function is altered by the pharmaceutical candidate. The 
Glycoprotein B may be any fragment encoded by the Glycoprotein B gene of the RFHV/KSHV 
subfamily that has Glycoprotein B activity. Suitable fragments may be obtained by expressing a 
genetically engineered polypeptide encoding an active site of the molecule, or by cleaving the 
Glycoprotein B with proteases and purifying the active fragments. In a preferred embodiment, the entire 

15 Glycoprotein B is provided. The reaction mixture will also comprise other components necessary to 
measure the biological activity of the protein. For example, in an assay to measure substrate binding, 
heparan sulfate or an analog thereof may be provided, perhaps linked to a solid support to facilitate 
measurement of the binding reaction. 

One embodiment of the screening method is to measure binding of the pharmaceutical candidate 

20 directly to the isolated Glycoprotein B, or a fragment thereof. Compounds that bind to an active site of 
the molecule are expected to interfere with Glycoprotein B activity. Thus, the entire Glycoprotein B, or a 
fragment comprising the active site, is mixed with the pharmaceutical candidate. Binding of the 
candidate can be measured directly, for example, by providing the candidate in a radiolabeled or 
stable-isotope labeled form. The presence of label bound to the Glycoprotein B can be determined, for 

25 example, by precipitating the Glycoprotein B with a suitable antibody, or by providing the molecule 
attached to a solid phase, and washing the solid phase after the reaction. Binding of the candidate to 
the Glycoprotein B may also be observed as a conformational change, detected for example by 
difference spectroscopy, nuclear magnetic resonance, or circular dichroism. Alternatively, binding may 
be determined in a competitive assay: for example, Glycoprotein B is mixed with the candidate, and 

30 then labeled nucleotide or a fragment of a regulatory subunit is added later. Binding of the candidate to 
the biochemically relevant site should inhibit subsequent binding of the labeled compound. 

A second embodiment of the screening method is to measure the ability of the pharmaceutical 
candidate to inhibit the binding of Glycoprotein B to a substrate or substrate analog. A preferred analog 
is heparin, coupled a solid support such as Sepharose™ beads. Inhibition may be measured, for 

35 example, by providing a radiolabel to the Glycoprotein B. incubating it with the pharmaceutical 
candidate, adding the affinity resin, then washing and counting the resin to determine if the candidate 
has decreased the amount of radioactivity bound. Pharmaceutical candidates may also be tested for 
their ability to competitively interfere with interactions between Glycoprotein B and other herpes virus 
proteins. 
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A third embodiment of the screening method is to measure the ability of the pharmaceutical 
candidate to inhibit an activity of an active particle, such as a viral particle, mediated by Glycoprotein B. 
A particle is engineered to express Glycoprotein B t but not other components that are capable of 
mediating the same function. The ability of the particle to exhibit a biological function, such as 
5 substrate binding or membrane fusion, is then measured in the presence and absence of the 
pharmaceutical candidate by providing an appropriate target 

This invention also provides for the development of pharmaceuticals for the treatment of herpes 
infection by rational drug design. See, generally, Hodgson, and Erickson et al. In this embodiment, the 
three-dimensional structure of the Glycoprotein B is determined, either by predictive modeling based on 
10 the amino acid sequence, or preferably, by experimental determination, Experimental methods include 
antibody mapping, mutational analysis, and the formation of antiidiotypes. Especially preferred is X-ray 
crystallography. Knowing the three-dimensional structure of the glycoprotein, especially the orientation 
of important amino acid groups near the substrate binding site, a compound is designed de novo, or an 
existing compound is suitably modified. The designed compound will have an appropriate charge 
15 balance, hydrophobicity, and/or shape to permit it to attach near an active site of the Glycoprotein B, 
and sterically interfere with the normal biochemical function of that site. Preferably, compounds 
designed by this method are subsequently tested in a drug screening assay, such as those outlined 
above. 



20 Antibodies against Glycoprotein B and their preparation 

The amino acid sequence of the Glycoprotein B molecules embodied herein are foreign to the 
hosts they infect. Glycoprotein B from other species of herpes virus are known to be strongly 
immunogenic in mammals. Anti-Glycoprotein B is formed in humans, for example, as a usual 

25 consequence of infection with hCMV. By analogy, it is expected that Glycoprotein B of RFHV, KSHV, 
and other members of the RFHV/KSHV subfamily will be immunogenic in mammals, including humans. 
These expectations are supported by the observations described in the Example section below. 

Antibodies against a polypeptide are generally prepared by any method known in the art To 
stimulate antibody production in an animal experimentally, it is often preferable to enhance the 

30 immunogenicity of a polypeptide by such techniques as polymerization with glutaraldehyde, or 
combining with an adjuvant, such as Freuncfs adjuvant. The immunogen is injected into a suitable 
experimental animal: preferably a rodent for the preparation of monoclonal antibodies; preferably a 
larger animal such as a rabbit or sheep for preparation of polyclonal antibodies. It is preferable to 
provide a second or booster injection after about 4 weeks, and begin harvesting the antibody source no 

35 less than about 1 week later. 

Sera harvested from the immunized animals provide a source of polyclonal antibodies. Detailed 
procedures for purifying specific antibody activity from a source material are known within the art. If 
desired, the specific antibody activity can be further purified by such techniques as protein A 
chromatography, ammonium sulfate precipitation, ion exchange chromatography, high-performance 
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liquid chromatography and immunoaffinity chromatography on a column of the immunizing polypeptide 
coupled to a solid support 

Polyclonal antibodies raised by immunizing with an intact Glycoprotein B or a fragment 
comprising conserved sequences may be cross-reactive between herpes viruses. Antibodies that are 
5 virus or subfamily specific may be raised by immunizing with a suitably specific antigen, such as those 
listed above in Table 8. Alternatively, polyclonal antibodies raised against a larger fragment may be 
rendered specific by removing unwanted activity against other virus Glycoprotein B's, for example, by 
passing the antibodies over an adsorbent made from Glycoprotein B and collecting the unbound 
fraction. 

10 Alternatively, immune cells such as splenocytes can be recovered from the immunized animals 

and used to prepare a monoclonal antibody-producing cell line. See, for example, Harrow & Lane 
(1988), U.S. Patent Nos. 4,472,500 (Milstein et ah), and U.S. 4,444,887 (Hoffman et al.). 

Briefly, an antibody-producing line can be produced inter alia by cell fusion, or by transforming 
antibody-producing cells with Epstein Barr Virus, or transforming with oncogenic DIMA. The treated 

15 cells are cloned and cultured, and clones are selected that produce antibody of the desired specificity. 
Specificity testing can be performed on culture supematants by a number of techniques, such as using 
the immunizing polypeptide as the detecting reagent in a standard immunoassay, or using cells 
expressing the polypeptide in immunohistochemistry. A supply of monoclonal antibody from the 
selected clones can be purified from a large volume of tissue culture supernatant, or from the ascites 

20 fluid of suitably prepared host animals injected with the clone. 

Effective variations of this method include those in which the immunization with the polypeptide 
is performed on isolated cells. Antibody fragments and other derivatives can be prepared by methods 
of standard protein chemistry, such as subjecting the antibody to cleavage with a proteolytic enzyme. 
Genetically engineered variants of the antibody can be produced by obtaining a polynucleotide 

25 encoding the antibody, and applying the general methods of molecular biology to introduce mutations 
and translate the variant. 

Monoclonal antibodies raised by injecting an intact Glycoprotein B or a fragment comprising 
conserved sequences may be cross-reactive between herpes viruses. Antibodies that are virus or 
subfamily specific may be raised by immunizing with a suitably specific antigen, as may be selected 

30 from Table 8. Alternatively, virus-specific clones may be selected from the cloned hybridomas by using 
a suitable antigen,' such as one selected from Class III of Table 8, in the screening process. 

Specific antibodies against herpes virus Glycoprotein B have a number of uses in 
developmental, diagnostic and therapeutic work. For example, antibodies can be used in drug 
screening (see U.S. Patent No. 5,120,639). They may also be used as a component of a passive 

35 vaccine, or for detecting herpes virus in a biological sample and for drug targeting, as described in the 
following sections. 

Antiidiotypes relating to Glycoprotein B may also be prepared. This is accomplished by first 
preparing a Glycoprotein B antibody, usually a monoclonal antibody, according to the aforementioned 
methodology. The antibody is then used as an immunogen in a volunteer or experimental animal to 
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raise an anti-idiotype. The antiidiotype may be either monoclonal or polyclonal, and its development is 
generally according to the methodology used for the first antibody. Selection of the anti-idiotype or 
hybridoma clones expressing anti-idiotype is done using the immunogen antibody as a positive 
selector, and using antibodies of unrelated specificity as negative selectors. Usually, the negative 
5 selector antibodies will be a polyclonal immunoglobulin preparation or a pool comprising monoclonal 
immunoglobulins of the same immunoglobulin class and subclass, and the same species as the 
immunogen antibody. An anti-idiotype may be used as an alternative component of an active vaccine 
against Glycoprotein B. 

1 0 Use of antibodies for detecting Glycoprotein B in biological samples 

Antibodies specific for Glycoprotein B can be used to detect Glycoprotein B polypeptides and 
fragments of viral origin that may be present, for example, in solid tissue samples and cultured cells. 
Immunohistological techniques to cany out such determinations will be obvious to a practitioner of 

15 ordinary skill. Generally, the tissue is preserved by a combination of techniques which may include 
freezing, exchanging into different solvents, fixing with agents such as paraformaldehyde, drying with 
agents such as alcohol, or embedding in a commercially available medium such as paraffin or OCT. A 
section of the sample is suitably prepared and overlaid with a primary antibody specific for the protein. 
The primary antibody may be provided directly with a suitable label. More frequently, the primary 

20 antibody is detected using one of a number of developing reagents which are easily produced or 
available commercially. Typically, these developing reagents are antiimmunoglobulin or protein A, and 
they typically bear labels which include, but are not limited to: fluorescent markers such as fluorescein, 
enrymes such as peroxidase that are capable of precipitating a suitable chemical compound, electron 
dense markers such as colloidal gold, or radioisotopes such as 125 l. The section is then visualized 

25 using an appropriate microscopic technique, and the level of labeling is compared between the 
suspected virally infected and a control cell, such as cells surrounding the area of infection or taken 
from a remote site. 

Proteins encoded by a Glycoprotein B gene can also be detected in a standard quantitative 
immunoassay. If the protein is secreted or shed from infected cell in any appreciable amount, it may be 
30 detectable in plasma or serum samples. Alternatively, the target protein may be sotubilized or extracted 
from a solid tissue sample. Before quantitating, the protein may optionally be affixed to a solid phase, 
such as by a blot technique or using a capture antibody. 

A number of immunoassay methods are established in the art for performing the quantitation. For 
example, the protein may be mixed with a pre-determined non-limiting amount of the reagent antibody 
35 specific for the protein. The reagent antibody may contain a directly attached label, such as an enzyme 
or a radioisotope, or a second labeled reagent may be added, such as antiimmunoglobulin or protein 
A. For a solid-phase assay, unreacted reagents are removed by washing. For a liquid-phase assay, 
unreacted reagents are removed by some other separation technique, such as filtration or 
chromatography. The amount of label captured in the complex is positively related to the amount of 
40 target protein present in the test sample. A variation of this technique is a competitive assay, in which 
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the target protein competes with a labeled analog for binding sites on the specific antibody. In this 
case, the amount of label captured is negatively related to the amount of target protein present in a test 
sample. Results obtained using any such assay are compared between test samples, and control 
samples from an uninfected source. 

5 

Use of antibodies for drug targeting 

An example of how antibodies can be used in therapy of herpes virus infection is in the specific 
targeting of effector components. Virally infected cells generally display peptides of the virus, especially 

10 proteins expressed on the outside of the viral envelope. The peptide therefore provides a marker for 
infected cells that a specific antibody can bind to. An effector component attached to the antibody 
therefore becomes concentrated near the infected cells, improving the effect on those cells and 
decreasing the effect on uninfected cells. Furthermore, if the antibody is able to induce endocytosis, 
this will enhance entry of the effector into the cell interior. 

15 For the purpose of targeting, an antibody specific for the viral polypeptide (in this case, a region 

of a Glycoprotein B) is conjugated with a suitable effector component, preferably by a covalent or 
high-affinity bond. Suitable effector components in such compositions include radionuclides such as 
131 1, toxic chemicals, and toxic peptides such as diphtheria toxin. Another suitable effector component 
is an arttisense polynucleotide, optionally encapsulated in a liposome. 

20 

Diagnostic kits 

Diagnostic procedures using the polynucleotides, oligonucleotides, peptides, or antibodies of this 
invention may be performed by diagnostic laboratories, experimental laboratories, practitioners, or 

25 private individuals. This invention provides diagnostic kits which can be used in these settings. The 
presence of a herpes virus in the individual may be manifest in a clinical sample obtained from that 
individual as an alteration in the DNA, RNA, protein, or antibodies contained in the sample. An 
alteration in one of these components resulting from the presence of a herpes virus may take the form 
of an increase or decrease of the level of the component, or an alteration in the form of the component. 

30 compared with that in a sample from a healthy individual. The clinical sample is optionally pre-treated 
for enrichment of the target being tested for. The user then applies a reagent contained in the kit in 
order to detect the changed level or alteration in the diagnostic component. 

Each kit necessarily comprises the reagent which renders the procedure specific: a reagent 
polynucleotide, used for detecting target DNA or RNA; a reagent antibody, used for detecting target 

35 protein; or a reagent polypeptide, used for detecting target antibody that may be present in a sample to 
be analyzed. The reagent is supplied in a solid form or liquid buffer that is suitable for inventory 
storage, and later for exchange or addition into the reaction medium when the test is performed. 
Suitable packaging is provided. The kit may optionally provide additional components that are useful in 
the procedure. These optional components include buffers, capture reagents, developing reagents, 
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labels, reacting surfaces, means for detection, control samples, instructions, and interpretive 
information. 

Other members of the RFHV/KSHV subfamily 

RFHV and KSHV are exemplary members of the RFHV/KSHV subfamily. This invention 
embodies polynucleotide sequences encoding Glycoprotein B of other members of the subfamily, as 
defined herein. The consensus-degenerate gamma herpes virus oligonucleotide Type 1 and 2 primers, 
and the methods described herein are designed to be suitable for characterization of the corresponding 

10 polynucleotide fragment of other members of the RFHV/KSHV subfamily. One such member is another 
virus infecting monkeys, designated RFHV2. A segment of the Glycoprotein encoding sequence for this 
vims was cloned from RF tissue obtained from a Macaca mulatta monkey, as described in Example 12. 

In order to identify and characterize other members of the family, reagents and methods of this 
invention are applied to DNA extracted from tissue samples suspected of being infected with such a 

15 virus. 

Suitable sources of DNA for this purpose include biological samples obtained from a wide range 
of conditions occurring in humans and other vertebrates. Preferred are conditions in which the agent is 
suspected of being lymphotrophic, similar to other members of the gamma herpes virus subfamily; for 
example, infectious mononucleosis of non-EBV origin. More preferred are conditions which resemble in 

20 at least one of their clinical or histological features the conditions with which RFHV or KSHV are 
associated. These include: a) conditions in which fibroproliferation is part of the pathology of the 
disease, especially in association with collagen deposition, and especially where the fibrous tissue is 
disorganized; b) conditions involving vascular dysplasia; c) conditions involving malignant 
transformation, especially but not limited to cells of lymphocyte lineage; d) conditions for which an 

25 underlying immunodeficiency contributes to the frequency or severity of the disease; e) conditions 
which arise idiopathically at multiple sites in an organ or in the body as a whole; f) conditions which 
epidemiological data suggests are associated with an infectious or environmental agent. Conditions 
which fulfill more than one of these criteria are comparably more preferred. Some examples of 
especially preferred conditions include retroperitoneal fibrosis, nodular fibromatosis. 

30 pseudosarcomatous fibromatosis, fibrosarcomas, sclerosing mesenteritis, acute respiratory disease 
syndrome, idiopathic pulmonary fibrosis, diffuse proliferative glomerulonephritis of various types, 
gliomas, glioblastomas, gliosis, and all types of leukemias and lymphomas. 

The type of tissue sample used will depend on the clinical manifestations of the condition. 
Samples more likely to contain a virus associated with the condition may be taken from the site involved 

35 in the disease pathology, or to which there is some other evidence of viral tropism. Peripheral blood 
mononuclear cells of an infected individual may also act as a carrier of an RFHV/KSHV subfamily virus. 
KSHV has been detected in PBMC of both Kaposi's Sarcoma (Moore et al. 1995b) and Castleman's 
disease (Dupin et al.). Other suitable sources are cell cultures developed from such sources, and 
enriched or isolated preparations of virus obtained from such sources. For negative control samples. 
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tissue may be obtained from apparently unaffected sites on the same individuals, or from matched 
individuals who apparently do not suffer from the condition. 

The process of identification of members of the RFHV/KSHV subfamily preferably involves the 
use of the methods and reagents provided in this invention, either singularly or in combination. 
5 One method involves amplifying a polynucleotide encoding a herpes virus Glycoprotein B from 

DNA extracted from the sample. This can be performed, for example, by amplifying the polynucleotide 
in a reaction such as a PCR. In one variation, the amplification reaction is primed using broadly specific 
consensus-degenerate Type 1 oligonucleotides, such as those shown in Table 4. This will amplify 
herpes viruses, primarily of the gamma type. Since the RFHV/KSHV subfamily is a subset of gamma 

10 herpes viruses, Glycoprotein B sequences detected by this variation need to be characterized further to 
determine whether they fall into the RFHV/KSHV subfamily. In a second variation, the amplification is 
primed with RFHV or KSHV specific Type 3 oligonucleotides, such as those listed in Table 7, or other 
Glycoprotein B polynucleotide segments taken from these viruses. The amplification is conducted 
under conditions of medium to low stringency, so that the oligonucleotides will cross-hybridize with 

15 related species of viruses. In a more preferred variation, the amplification reaction is primed using 
RFHV/KSHV subfamily specific Type 2 oligonucleotides, such as those listed in Table 6. Under 
appropriate hybridization conditions, these primers will preferentially amplify Glycoprotein B from herpes 
viruses in the subfamily. 

Preferred members of the subfamily detected using a Glycoprotein B polynucleotide probe are 

20 those that are at least 65% identical with the RFHV or KSHV Glycoprotein B nucleotide sequence 
between residues 36 and 354 of SEQ. ID NO:1 or SEQ. ID NO:3. More preferred are those that are at 
least about 67% identical; more preferred are those at least about 70% identical; more preferred are 
those that are at least about 80% identical; even more preferred are those about 90% identical or more. 
Members of the subfamily can also be identified by performing a hybridization assay on the 

25 polynucleotide of the sample, using a suitable probe. The polynucleotide to be tested may optionally be 
amplified before conducting the hybridization assay, such as by using Type 1 or Type 2 
oligonucleotides as primers. The target is then tested in a hybridization reaction with a suitable labeled 
probe. The probe preferably comprises at least 21 nucleotides, preferably at least about 25 
nucleotides, more preferably at least about 50 nucleotides contained the RFHV or KSHV Glycoprotein B 

30 sequence in SEQ. ID NOS:1 and 3. Even more preferably, the probe comprises nucleotides 36-354 of 
SEQ. ID NOS:1 or 3. Other preferred probes include Type 2 oligonucleotides, such as those shown in 
Table 6. Hybridization conditions are selected to permit the probe to hybridize with Glycoprotein B 
polynucleotide sequences from the RFHV/KSHV subfamily, but not previously sequenced herpes 
viruses; particularly sHV1, bHV4, eHV2, mHV68, hEBV. hCMV. hHV6, hVZV, and HSV1. Formation of 

35 a stable duplex with the test polynucleotide under these conditions suggests the presence of a 
polynucleotide in the sample derived from a member of the RFHV/KSHV subfamily. 

Members of the subfamily can also be identified by using a Class II antibody, the preparation of 
which was outlined earlier. A Class II antibody cross-reacts between antigens produced by members of 
the RFHV/KSHV subfamily, but not with other antigens, including those produced by herpes viruses not 
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members of the subfamily. The test for new subfamily memers is performed, for example, by using the 
antibodies in an immunohistochemistry study of tissue sections prepared from individuals with the 
conditions listed above. Positive staining of a tissue section with the antibody suggests the presence of 
Glycoprotein B in the sample from a member of the RFHV/KSHV subfamily, probably because the 
5 tissue is infected with the virus. If, in addition, the tissue section is non-reactive with RFHV and KSHV 
specific Class III antibodies, the Glycoprotein B in the tissue may be derived from another member of 
the subfamily. Similarly, if Class II antibodies are found in the circulation of an individual, the individual 
may have been subject to a present or past infection with a member of the RFHV/KSHV subfamily. 

Once a putative new virus is identified by any of the aforementioned methods, its membership in 

1 0 the RFHV/KSHV subfamily may be confirmed by obtaining and sequencing a region of the Glycoprotein 
B gene of the vims, and comparing it with that of RFHV or KSHV according to the subfamily definition. 
For new members of the RFHV/KSHV subfamily, other embodiments of this invention may be brought 
into play for purposes of detection, diagnosis, and pharmaceutical development Adaptation of the 
embodiments of the invention for a new subfamily member, if required, is expected to be minor in 

1 5 nature, and will be obvious based on the new sequence data, or a matter of routine adjustment. 

Altered forms of Glycoprotein B from the RFHV/KSHV subfamily 

This invention also embodies altered forms of Glycoprotein B of the RFHV/KSHV subfamily. 

20 A number of studies on both naturally occurring and induced mutations of the Glycoprotein B of 

HSV1 and hCMV point to a role of certain regions of the molecule for its the various biochemical 
functions. See, for example, Reschke et al. and Baghian et al. for a role of carboxy-terminal amino 
acids in fusion; Shiu et al. and Pellett et al. for epitopes for neutralizing antibodies; Gage et al. for 
regions of the molecule involved tn syncytium formation; Navarro et al. (1992) for regions involved in 

25 virus penetration and cell-to-cell spread; Quadri et al. and Novarro et al. (1991) for regions involved in 
intracellular transport of Glycoprotein B during biosynthesis. 

Some of the residues described may be conserved between the Glycoprotein B molecules of the 
viruses investigated previously, and the Glycoprotein B molecules described herein. By analogy, 
mutation of the same residue in the Glycoprotein B of the RFHV/KSHV subfamily is expected to have a 

30 similar effect as described for other viruses. Alternatively, functional regions of different Glycoprotein B 
molecules may be combined to produce Glycoprotein B recombinants with altered function. For 
example, replacing the Glycoprotein B gene in a pathogenic virus with that of a non-pathogenic virus 
may reduce the pathogenicity of the recombinant (Kostal et al ). Either mutation and recombination of 
Glycoprotein B of the RFHV/KSHV herpes virus subfamily may lead to attenuated strains, in which 

35 either the infectivity, replication activity, or pathogenicity is reduced. Alterations in the Glycoprotein B 
sequence which have these effects are contemplated in this invention. 

Attenuated strains of herpes viruses are useful, for example, in developing polyvalent vaccines. 
It is desirable, especially in developing countries, to provide prophylactic vaccines capable of 
stimulating the immune system against several potential pathogens simultaneously. Viruses that are 

40 engineered to express immunogenic peptides of several different pathogens may accomplish this 
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purpose. Herpes viruses may be especially suitable vectors, because the targe genome may easily 
accommodate several kilobases of extra DNA encoding the peptides. Ideally, the viral vector is 
sufficiently intact to exhibit some biological activity and attract the attention of the host* s immune 
system, while at the same time being sufficiently attenuated not to cause significant pathology. Thus, 
5 an attenuated virus of the RFHV/KSHV subfamily may be useful as a vaccine against like virulent 
forms, and may be modified to express additional peptides and extend the range of immune protection. 

Another use for attenuated forms of herpes viruses is as delivery vehicles for gene therapy 
(Latchman et al., Glorioso et a).). In order to be effective, polynucleotides in gene therapy must be 
delivered to the target tissue site. In the treatment of fibrotic diseases, malignancies and related 
10 conditions, attenuated viral vectors of the RFHV/KSHV subfamily may be preferable over other 
targeting mechanisms, including other herpes viruses, since they have the means by which to target 
towards the affected tissues. In this embodiment, the virus is first attenuated, and then modified to 
contain the polynucleotide that is desired for gene therapy, such as those that are outlined in a previous 
section. 

15 

Glycoprotein B in RFHV/KSHV subfamily vaccines 

Because of its prominence on the envelope of the infectious virus and infected cells, glycoprotein 
B is predicted to be a useful target for immune effectors. Herpes virus Glycoprotein B is generally 

20 immunogenic, giving rise to antibodies capable of neutralizing the virus and preventing it from entering 
a replicative phase. In addition, Glycoprotein B is capable of eliciting a T-cell response, which may help 
eradicate an ongoing viral infection by attacking sites of viral replication in host cells. 

This invention embodies vaccine compositions and methods for using them in the prevention and 
management of infection by viruses from the RFHV/KSHV subfamily. 

25 One series of embodiments relate to active vaccines. These compositions are designed to 

stimulate an immune response in the individual being treated against Glycoprotein B. They generally 
comprise either the Glycoprotein B molecule, an immunogenic fragment or variant thereof, or a cell or 
particle capable of expressing the Glycoprotein B molecule. Alternatively, they may comprise a 
polynucleotide encoding an immunogenic Glycoprotein B fragment (Horn et al.), preferably in the form 

30 of an expression vector. Polynucleotide vaccines may optionally comprise a delivery vehicle like a 
liposome or viral vector particle, or may be administered as naked DNA. 

Vaccine compositions of this invention are designed in such a way that the immunogenic 
fragment is presented to stimulate the proliferation and/or biological function of the appropriate immune 
cell type. Compositions directed at eliciting an antibody response comprise or encode B cell epitopes, 

35 and may also comprise or encode other elements that enhance uptake and display by 
antigen-presentation cells, or that recruit T cell help. Compositions directed at eliciting helper T cells, 
especially CD4* cells, generally comprise T cell epitopes that can be presented in the context of class II 
histocompatibility molecules. Compositions directed at stimulating cytotoxic T cells and their 
precursors, especially CD8 + cells, generally comprise T cell epitopes that can be presented in the 

40 context of class I histocompatibility molecules. 
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In the protection of an individual against a future exposure with herpes virus, an antibody 
response may be sufficient. Prophylactic compositions preferably comprise components that elicit a B 
cell response. Successful eradication of an ongoing herpes virus infection may involve the participation 
of cytotoxic T cells, T helper-inducer cells, or both. Infections for treating ongoing infection preferably 
5 comprise components capable of eliciting both T helper cells and cytotoxic T cells. Compositions that 
preferentially stimulate Type 1 helper (T H1 ) cells over Type 2 helper (T^) cells are even more preferred. 
The preparation and testing of suitable compositions for active vaccines is outlined in the sections that 
follow. 

Another series of embodiments relates to passive vaccines and other materials for adoptive 

10 transfer. These compositions generally comprise specific immune components against Glycoprotein B 
that are immediately ready to participate in viral neutralization or eradication. Therapeutic methods 
using these compositions are preferred to prevent pathologic consequences of a recent viral exposure. 
They are also preferred in immunocompromized individuals incapable of mounting a sufficient immune 
response to an active vaccine. Such individuals include those with congenital immunodeficiencies, 

15 acquired immunodeficiencies (such as those infected with HIV or on kidney dialysis), and those on 
immunosuppressive therapies, for example, with corticosteroids. 

Suitable materials for adoptive transfer include specific antibody against Glycoprotein B. as 
described below. Also included are the adoptive transfer of immune cells. For example, T cells 
reactive against Glycoprotein B may be taken from a donor individual, optionally cloned or cultured in 

20 vitro, and then transferred to a histocompatible recipient. More preferably, the transferred cells are 
autologous to the recipient and stimulated in vitro. Thus, T cells are purified from the individual to be 
treated, cultured in the presence of immunogenic components of Glycoprotein B and suitable 
stimulatory factors to elicit vims-specific cells, and then readministered. 

Certain compositions embodied herein may have properties of both active and passive vaccines. 

25 For example, Glycoprotein B antibody given by adoptive transfer may confer immediate protection 
against herpes virus, and may also stimulate an ongoing response, through an antiidiotype network, or 
by enhancing the immune presentation of viral antigen. 

Vaccines comprising Glycoprotein B polypeptides 

30 

Specific components of vaccines to stimulate an immune response against Glycoprotein B 
include the intact Glycoprotein B molecule, and fragments of Glycoprotein B that are immunogenic in 
the host. 

Intact Glycoprotein B and longer fragments thereof may be prepared by any of the methods 
35 described earlier, especially purification from a suitable expression vector comprising a Glycoprotein B 
encoding polynucleotide. Isolated Glycoprotein B from other viral strains stimulate a protective immune 
response (See U.S. Patent Nos. 5,171,568: Burke et al ). Preferred fragments comprise regions of the 
molecule exposed on the outside of the intact viral envelope; located within about 650 amino acids of 
the N-terminal of the mature protein. 

6o 
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Gtycosytation of Glycoprotein B Is not required for immunogenicity (O'Donnell et al.). Hence, 
glycosylated and unglycosylated forms of the molecule are equally preferred. Glycosylation may be 
determined by standard techniques; for example, comparing the mobility of the protein in SDS 
polyacn/lamide gel electrophoresis before and after treating with commercially available 
5 endoglycosidase type F or H. 

Smaller fragments of 5-50 amino acids comprising particular epitopes of Glycoprotein B are also 
suitable vaccine components. These may be prepared by any of the methods described earlier; most 
conveniently, by chemical synthesis. Preferred fragments are those which are immunogenic and 
expressed on the outside of the viral envelope. Even more preferred are fragments implicated in a 
1 0 biological function of Glycoprotein B, such as binding to cell surface receptors or penetration of the vims 
into a target cell. 

Immunogenicity of various epitopes may be predicted by algorithms known in the art. Antigenic 
regions for B cell receptors may be determined, for example, by identifying regions of variable polarity 
(Hopp et al., see Example 9). Antigenic regions for T cell receptors may be determined, for example, 

15 by identifying regions capable of forming an amphipathic helix in the presentation groove of a 
histocompatibility molecule. Antigenic regions may also be identified by analogy with Glycoprotein B 
molecules of other viral species. See, e.g., Sanchez-Pescador et al. and Mester et al., for B cell 
epitopes of HSV1; Liu et al. for HLA-restricted helper T cell epitopes of hCMV; and Hanke et al. for 
cytotoxic T lymphocyte epitopes of HSV1 . 

20 Immunogenicity of various epitopes may be measured experimentally by a number of different 

techniques. Generally, these involve preparing protein fragments of 5-20 amino acids in length 
comprising potential antigenic regions, and testing them in a specific bioassay. Fragments may be 
prepared by CNBr and/or proteolytic degradation of a larger segment of Glycoprotein B, and purified, for 
example, by gel electrophoresis and blotting onto nitrocellulose (Demote et al ). Fragments may also 

25 be prepared by standard peptide synthesis (Schumacher et al., Liu et at.). In a preferred method, 
consecutive peptides of 12 amino acids overlapping by 8 residues are synthesized according to the 
entire extracellular domain of the mature Glycoprotein B molecule, using F-Moc chemistry on a nylon 
membrane support (see Example 11). 

Reactivity against the prepared fragment can then be determined in samples from individuals 

30 exposed to the intact virus or a Glycoprotein B component. The individual may have been 
experimentally exposed to the Glycoprotein B component by deliberate administration. Alternatively, 
the individual may have a naturally occurring viral infection, preferably confirmed by a positive 
amplification reaction using a virus-specific oligonucleotide probe to Glycoprotein B or DNA 
Polymerase. Blood samples are obtained from the individual, and used to prepare serum, T cells, and 

35 peripheral blood mononuclear cells (PBMC) by standard techniques. 

Serum may be tested for the presence of Glycoprotein B specific antibody in an enzyme-linked 
immunosorbant assay. For example, peptides attached to a solid support such as a nylon membrane 
are incubated with the serum, washed, incubated with an enzyme-linked antt-immunoglobulin, and 
developed using an enzyme substrate. The presence of antibody against a particular Glycoprotein B 
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peptide is indicated by a higher level of reaction product in the test well than in a well containing an 
unrelated peptide (Example 1 1). 

Lymphocyte preparations may be tested for the presence of Glycoprotein B specific helper T 
cells in a proliferation assay. Approximately 2 x 10 4 helper T cells are incubated with the peptide at 10" 
4 -10-® M in the presence of irradiated autologous or irradiated 10 s PBMC as antigen presenting cells for 
about 3 days. ^HJThymidine is added for about the last 16 h of culture. The cells are then harvested 
and washed. Radioactivity in the washed cells at a level of about 10 fold over those cultured in the 
absence of peptide reflects proliferation of T cells specific for the peptide (Liu et al. j. If desired, cells 
with a CD3V8" phenotype may be cloned for further characterization of the helper T cell response. 

Lymphocyte preparations may be tested for the presence of Glycoprotein B specific cytotoxic T 
cells in a 51 Cr release assay. Targets are prepared by infecting allogeneic cells with a herpes virus 
comprising an expressible Glycoprotein B gene. Alternatively, allogeneic cells transfected with a 
Glycoprotein B expression vector may be used. The targets are incubated with 51 Cr for about 90 min at 
37-C and then washed. About 5 x 10 4 target cells are incubated with lO^-IO* 5 M of the peptide and 
0.1-2 x 10 4 test T cells for about 30 min at 37*C. Radioactivity released into the supernatant at a level 
substantially above that due to spontaneous lysis reflects CTL activity. If desired, cells with a CD3*4*8* 
phenotype may be cloned for further characterization of the CTL response. 

Glycoprotein B peptides may optionally be combined in a vaccine with other peptides of the 
same virus. Suitable peptides include peptides of any of the other components of the herpes virus, 
such as Glycoproteins C, D, H, E, I, J, and G. Glycoprotein B peptides may also optionally be 
combined with immunogenic peptides from different viruses to provide a multivalent vaccine against 
more than one pathogenic organism. Peptides may be combined by preparing a mixture of the 
peptides in solution, or by synthesizing a fusion protein in which the various peptide components are 
linked. 

Forms of Glycoprotein B comprising suitable epitopes may optionally be treated chemically to 
enhance their immunogenidty, especially if they comprise 100 amino acids or less. Such treatment 
may include cross-linking, for example, with glutaraldehyde; linking to a protein carrier, such as keyhole 
limpet hemocyanin (KLH) or tetanus toxoid. 

The peptide or peptide mixture may be used neat, but normally will be combined with a 
physiologically and pharmacologically acceptable excipient, such as water, saline, physiologically 
buffered saline, or sugar solution. , 

In a preferred embodiment, an active vaccine also comprises an adjuvant which enhances 
presentation of the immunogen or otherwise accentuates the immune response against the 
immunogen. Suitable adjuvants include alum, aluminum hydroxide, beta-2 microglobulin (WO 
91/16924: Rock et al.), muramyl dipeptides, muramyl tripeptides (U.S. Patent No. 5,171,568: Burke et 
al.), and monophosphoryl lipid A (U.S. Patent No. 4,436,728: Ribi et al.; and WO 92/16231: Francotte et 
al.). Immunomodulators such as Interfeukin 2 may also be present. The peptide and other components 
(if present) are optionally encapsulated in a liposome or microsphere. For an outline of the 
experimental testing of various adjuvants, see U.S. Patent No. 5,171,568 (Burke et al.). A variety of 
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adjuvants may be efficacious. The choice of an adjuvant will depend at least in part on the stability of 
the vaccine in the presence of the adjuvant, the route of administration, and the regulatory acceptability 
of the adjuvant, particularly when intended for human use. 

Polypeptide vaccines generally have a broad range of effective latitude. The usual route of 
5 administration is intramuscular, but preparations may also be developed which are effective given by 
other routes, including intravenous, intraperitoneal, oral, intranasal, and by inhalation. The total amount 
of Glycoprotein B polypeptide per dose of vaccine when given intramuscularly will generally be about 10 
jig to 5 mg; usually about 50 jig to 2 mg; and more usually about 100 to 500 fig. The vaccine is 
preferably administered first as a priming dose, and then again as a boosting dose, usually at least four 
10 weeks later. Further boosting doses may be given to enhance or rejuvenate the response on a periodic 
basis. 

Vaccines comprising viral particles expressing Glycoprotein B 

15 Active vaccines may also be prepared as particles that express an immunogenic epitope of 

Glycoprotein B. 

One such vaccine comprises the L-particle of a recombinant herpes virus (see U.S. Patent No. 
5,284,122: Cunningham et al.). The genome of the recombinant vims is defective in a capsid 
component, or otherwise prevented from forming intact virus; however, it retains the ability to make 

20 L-particies. The genome is engineered to include a Glycoprotein B encoding polynucleotide of the 
present invention operatively linked to the controlling elements of the recombinant virus. The vims is 
then grown, for example, in cultured cells, and the particles are purified by centrifugation on a suitable 
gradient, such as FICOLL™. Such preparations are free of infective vims, and capable of expressing 
peptide components of a number of different desirable epitopes. 

25 Another such vaccine comprises a live virus that expresses Glycoprotein B of the present 

invention as a heterologous antigen. Such viruses include HIV, SIV, FIV, equine infectious anemia, 
visna vims, and herpes viruses of other species. The vims should be naturally non-pathogenic in the 
species to be treated; or alternatively, it should be attenuated by genetic modification, for example, to 
reduce replication or virulence. Herpes vims may be attenuated by mutation of a gene involved in 

30 replication, such as the DNA Polymerase gene. Herpes vims may also be attenuated by deletion of an 
essential late-stage component, such as Glycoprotein H (WO 92/05263: Inglis et al.). A live vaccine 
may be capable of a low level of replication in the host, particularly if this enhances protein expression, 
but not to the extent that it causes any pathological manifestation in the subject being treated. 

A preferred viral species for preparing a live vaccine is adenovirus. For human therapy, human 

35 adenovirus types 4 and 7 have been shown to have no adverse affects, and are suitable for use as 
vectors. Accordingly, a Glycoprotein B polynucleotide of the present invention may be engineered, for 
example, into the E1 or E3 region of the viral genome. It is known that adenovirus vectors expressing 
Glycoprotein B from HSV1 or HSV2 stimulate the production of high titer vims-neutralizing antibody 
(McDermott et al.). The response protects experimental animals against a lethal challenge with the 

40 respective live vims. 
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Also preferred as a virus for a live recombinant vaccine is a recombinant pox vims, especially 
vaccinia. Even more preferred are strains of vaccinia virus which have been modified to inactivate a 
non-essential virulence factor, for example, by deletion or insertion of an open reading frame relating to 
the factor (U.S. Patent No. 5,364,773: Paoletti et al.). To prepare the vaccine, a Glycoprotein B 
encoding polynucleotide of the present invention is genetically engineered into the viral genome and 
expressed under control of a vaccinia virus promoter. Recombinants of this type may be used directly 
for vaccination at about 10 7 -10 B plaque-forming units per dose. Single doses may be sufficient to 
stimulate an antibody response. Vaccinia vims recombinants comprising Glycoprotein B of HSV1 are 
effective in protecting mice against lethal HSV1 infection (Cantin et al.). 

Another vaccine in this category is a self-assembling replication-defective hybrid vims. See, for 
example, WO 92/05263 (Inglis et al.). The particle may contain, for example, capsid and envelope 
glycoproteins, but not an intact viral genome. As embodied in this invention, one of the glycoproteins in 
the viral envelope is Glycoprotein B. 

In a preferred embodiment, the particle is produced by a viral vector of a first species, having a 
sufficient segment of the genome of that species to replicate, along with encoding regions for a capsid 
and an envelope from a heterologous species (see U.S. Patent No. 5,420,026: Payne). Genetic 
elements of the first species are selected such that infection of eukaryotic cells with the vector produces 
capsid and envelope glycoproteins that self-assemble into replication-defective particles. In a variant of 
this embodiment, polynucleotides encoding the capsid and envelope glycoproteins are provided in two 
separate vectors derived from the first viral species. The capsid encoding regions may be derived from 
a lentivirus, such as HIV, SIV. FIV, equine infectious anemia virus, or visna virus. The envelope 
encoding regions comprise a Glycoprotein B encoding polynucleotide of the present invention. 
Preferably, all envelope components are encoded by a herpes virus, particularly of the RFHV/KSHV 
subfamily. The defective viral particles are obtained by infecting a susceptible eukaryotic cell line such 
as BSC-40 with the vectors) and harvesting the supernatant after about 18 hours. Viral particles may 
be further purified, if desired, by centrtfugation through a sucrose cushion. Particles may also be 
treated with 0.8% formalin at 40*C for 24 hours prior to administration as a vaccine. 

Vaccines comprising a live attenuated vims or vims analog may be lyophilized for refrigeration. 
Diluents may optionally include tissue culture medium, sorbitol, gelatin, sodium bicarbonate, albumin, 
gelatin, saline solution, phosphate buffer, and sterile water. Other active components may optionally be 
added, such as attenuated strains of measles, mumps, and rubella, to produce a polyvalent vaccine. 
The suspension may be lyophilized, for example, by the gas injection technique. This is performed by 
placing vials of vaccine in a lyophilizing chamber precooled to about -45*C with 10-18 Pa of dry sterile 
argon, raising the temperature about 5-25*0 per h to +30*C, conducting a second lyophilizing cycle with 
full vacuum, and then sealing the vials under argon in the usual fashion (see EP 02901 97B1 : Mcaleer et 
al.). For vaccines comprising live herpes vims, the final lyophilized preparation will preferably contain 
2-8% moisture. 

It is recognized that a number of alternative compositions for active vaccines, not limited to those 
described here in detail, may be efficacious in eliciting specific B- and T-cell immunity. All such 
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compositions are embodied in the spirit of the present invention, providing they include a RFHV7KSHV 
subfamily Glycoprotein B polynucleotide or polypeptide as an active ingredient 

Vaccines comprising Glycoprotein B antibodies 

5 

Antibody against Glycoprotein B of the RFHV7KSHV subfamily may be administered by adoptive 
transfer to immediately confer a level of humoral immunity in the treated subject. Passively 
administered anti-glycoprotein B experimentally protects against a lethal challenge with other herpes 
viruses, even in subjects with compromised T-cell immunity (Eis-Hubinger et al.). 

10 The antibody molecule used should be specific for Glycoprotein B against which protection is 

desired. It should not cross-reactive with other antigens, particularly endogenous antigens of the 
subject to be treated. The antibody may be specific for the entire RFHV/KSHV subfamily (Class II 
antibodies), or for a particular virus species (Class III antibodies), depending on the objective of the 
treatment. Preferably, the antibody will have an overall affinity for a polyvalent antigen of at least about 

15 10 s NT 1 ; more preferably it will be at least about 10 to M" 1 ; more preferably it will be at least about 10 12 
NT 1 ; even more preferably, it will be 10 13 NT 1 or more. Intact antibody molecules, recombinants, fusion 
proteins, or antibody fragments may be used; however, intact antibody molecules or recombinants able 
to express natural antibody effector functions are preferred. Relevant effector functions include but are 
not limited to virus aggregation; antibody-dependent cellular cytotoxicity; complement activation; and 

20 opsonization. 

Antibody may be prepared according to the description provided in an earlier section. For 
systemic protection, the antibody is preferably monomelic, and preferably of the IgG class. For 
mucosal protection, the antibody may be polymeric, preferably of the IgA class. The antibody may be 
either monoclonal or polyclonal; typically, a cocktail of monoclonal antibodies is preferred. It is also 
25 preferred that the preparation be substantially pure of other biological components from the original 
antibody source. Other antibody molecules of desired reactivity, and carriers or stabilizers may be 
added after purification. 

In some instances, It is desirable that the antibody resemble as closely as possible an antibody 
of the species which is to be treated. This is to prevent the administered antibody from becoming itself 

30 a target of the recipient's immune response. Antibodies of this type are especially desirable when the 
subject has an active immune system, or when the antibodies are to be administered in repeat doses. 

Accordingly, this invention embodies anti-Glycoprotein B antibody which is human, or which has 
been humanized. Polyclonal human antibody may be purified from the sera of human individuals 
previously infected with the respective RFHV/KSHV subfamily herpes virus, or from volunteers 

35 administered with an active vaccine. Monoclonal human antibody may be produced from the 
lymphocytes of such individuals, obtained, for example, from peripheral blood. In general, human 
hybridomas may be generated according to the methods outlined earlier. Usually, the production of 
stable human hybridomas will require a combination of manipulative techniques, such as both fusion 
with a human myeloma cell line and transformation, for example, with EBV. 
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In a preferred method, human antibody is produced from a chimeric non-primate animal with 
functional human immunoglobulin loci (WO 91/10741: Jakobovits et al.). The non-primate animal strain 
(typically a mouse) is incapable of expressing endogenous immunoglobulin heavy chain, and optimally 
at least one endogenous immunoglobulin light chain. The animals are genetically engineered to 
express human heavy chain, and optimally also a human light chain. These animals are immunized 
with a Glycoprotein B of the RFHV/KSHV subfamily of herpes viruses. Their sera can then be used to 
prepare polyclonal antibody, and their lymphocytes can be used to prepare hybridomas in the usual 
fashion. After appropriate selection and purification, the resultant antibody is a human antibody with the 
desired specificity. 

In another preferred method, a monoclonal antibody with the desired specificity for Glycoprotein 
B is first developed in another species, such as a mouse, and then humanized. To humanize the 
antibody, the polynucleotide encoding the specific antibody is isolated, antigen binding regions are 
obtained, and then recombined with polynucleotides encoding elements of a human immunoglobulin of 
unrelated specificity. Alternatively, the nucleotide sequence of the specific antibody is obtained and 
used to design a related sequence with human characteristics, which can be prepared, for example, by 
chemical synthesis. The heavy chain constant region or the light chain constant region of the specific 
antibody, preferably both, are substituted with the constant regions of a human immunoglobulin of the 
desired class. Preferably, segments of the variable region of both chains outside the complementarity 
determining regions (COR) are also substituted with their human equivalents (EP 0329400: Winter). 

Even more preferably, segments of the variable region are substituted with their human 
equivalents, providing they are not involved either in antigen binding or maintaining the structure of the 
binding site. Important amino acids may be identified, for example, as described by Padlan. In one 
particular technique (WO 94/11509: Couto et al.), a positional consensus sequence is developed using 
sequence and crystallography data of known immunoglobulins: The amino acid sequence of the 
Glycoprotein B specific antibody is compared with the model sequence, and amino acids involved in 
antigen binding, contact with CDR's. or contact with opposing chains are identified. The other amino 
acids are altered, where necessary, to make them conform to a consensus of human immunoglobulin 
sequences. A polynucleotide encoding the humanized sequence is then prepared, transfected into a 
host cell, and used to produce humanized antibody with the same Glycoprotein B specificity as the 
originally obtained antibody clone. 

Specific antibody obtained using any of these methods is generally sterilized, mixed with a 
pharmaceutical^ compatible excipient. Stabilizers such as 0.3 molar glycine, and preservatives such 
as 1:10,000 thimerosal, may also be present. The suspension may be buffered to neutral pH (-7.0), for 
example, by sodium carbonate. The potency may optionally be adjusted by the addition of normal 
human IgG, obtained from large pools of normal plasma, for example, by the Cohn cold ethanol 
fractionation procedure. Other diluents, such as albumin solution, may be used as an alternative. The 
concentration is adjusted so that a single dose administration constitutes 0.005-0.2 mg/kg, preferably 
about 0.06 mg/kg. A single dose preferably results in a circulating level of anti-Glycoprotein B, as 
detected by ELISA or other suitable technique, which are comparable to those observed in individuals 
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who have received an active Glycoprotein B vaccine or have recovered from an acute infection with the 
corresponding virus, or which are known from experimental work to be protective against challenges 
with a pathologic dose of virus. 

Administration should generally be performed by intramuscular injection, not intravenously, and 
5 care should be taken to assure that the needle is not in a blood vessel. Special care should be taken 
with individuals who have a history of systemic allergic reactions following administration of human 
globulin. For prophylactic applications, the antibody preparation may optionally be administered in 
combination with an active vaccine for Glycoprotein B, as described in the preceding sections. For 
post-exposure applications, the antibody preparation is preferably administered within one week of the 

10 exposure, more preferably within 24 hours, or as soon as possible after the exposure. Subsequent 
doses may optionally be given at approximately 3 month intervals. 

As for all therapeutic instruments described herein, the amount of composition to be used, and 
the appropriate route and schedule of administration, will depend on the clinical status and 
requirements of the particular individual being treated. The choice of a particular regimen is ultimately 

15 the responsibility of the prescribing physician or veterinarian. 

The foregoing description provides, inter alia, a detailed explanation of how Glycoprotein B 
encoding regions of herpes viruses, particularly those of the RFHV/KSHV subfamily, can be identified 
and their sequences obtained. Polynucleotide sequences for encoding regions of Glycoprotein B of 

20 both RFHV and KSHV are provided. 

The polynucleotide sequences listed herein for RFHV and KSHV are believed to be an accurate 
rendition of the sequences contained in the polynucleotides from the herpes viruses in the tissue 
samples used for this study. They represent a consensus of sequence data obtained from multiple 
clones. However, it is recognized that sequences obtained by amplification methods such as PCR may 

25 comprise occasional errors in the sequence as a result of amplification. The error rate is estimated to 
be between about 0.44% and 0.75% for single determinations; about the same rate divided by V(n-1) for 
the consensus of n different determinations. Nevertheless, the error rate may be as high as 1% or 
more. Sequences free of amplification errors can be obtained by creating a library of herpes virus 
polynucleotide sequences, using oligonucleotides such as those provided in Table 7 to select relevant 

30 clones, and sequencing the DNA in the selected clones. The relevant methodology is well known to a 
practitioner of ordinary skill in the art, who may also wish to refer to the description given in the Example 
section that follows. 

ft is recognized that allelic variants and escape mutants of herpes viruses occur. 
Polynucleotides and polypeptides may be isolated or derived that incorporate mutations, either naturally 
35 occurring, or accidentally or deliberately induced, without departing from the spirit of this invention. 

The examples presented below are provided as a further guide to a practitioner of ordinary skill 
in the art, and are not meant to be limiting in any way. 
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Examples 

Example 1: Oligonucleotide primers for Herpes Virus Glycoprotein B 

5 Amino acid sequences of known herpes virus Glycoprotein B molecules were obtained from the 

PIR protein database, or derived from DNA sequences obtained from the GenBank database. The 
sequences were aligned by computer-aided alignment programs and by hand. 

Results are shown in Figure 3. sHV1, bHV4, mHV68, EBV and hHV6 sequences were used to 
identity regions that were relatively well conserved, particularly amongst the gamma herpes viruses. 
10 Nine regions were chosen for design of amplification primers. The DNA sequences for these regions 
were then used to design the oligonucleotide primers. The primers were designed to have a 
degenerate segment of 8-14 base pairs at the 3' end, and a consensus segment of 18-30 bases at the 
5* end. This provides primers with optimal sensitivity and specificity. 

The degenerate segment extended across highly conserved regions of herpes virus Glycoprotein 
15 B sequences, encompassing the least number of alternative codons. The primers could therefore be 
synthesized with alternative nucleotide residues at the degenerate positions and yield a minimum 
number of combinations. There were no more than 256 alternative forms for each of the primers 
derived. 

The consensus segment was derived from the corresponding flanking region of the Glycoprotein 
20 B sequences. Generally, the consensus segment was derived by choosing the most frequently 
occurring nucleotide at each position of all the Glycoprotein B sequences analyzed. However, selection 
was biased in favor of C or G nucleotides, to maximize the ability of the primers to form stable duplexes. 

Results are shown in Figures 4-12, and are summarized in Table 4. In a PCR, oligonucleotides 
listed in Table 4 as having a "sense" orientation would act as primers by hybridizing with the strand 
25 antisense to the coding strand, and initiating polymerization in the same direction as the Glycoprotein B 
encoding sequence. Oligonucleotides listed in Table 4 as having an -antisense" orientation would 
hybridize with the coding strand and initiate polymerization in the direction opposite to that of the 
Glycoprotein B encoding sequence. 

Synthetic oligonucleotides according to the designed sequences were ordered and obtained from 
30 Oligos Etc, Inc. 

Example 2: DNA extraction 

Biopsy specimens were obtained from Kaposi's sarcoma lesions from human subjects 
35 diagnosed with AIDS. The specimens were fixed in paraformaldehyde and embedded in paraffin, which 
were processed for normal histological examination. 

Fragments of the paraffin samples were extracted with 500 \iL of xylene in a 1.5 mL 
EPPENDORF™ conical centrifuge tube. The samples were rocked gently for 5 min at room 
temperature, and the tubes were centrifuged in an EPPENDORF™ bench-top centrifuge at 14.000 rpm 
40 for 5 min. After removing the xylene with a Pasteur pipette, 500 nL of 95% ethanol was added, the 
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sample was resuspended, and then re-centrifuged. The ethanol was removed, and the wash step was 
repeated. Samples were then air-dried for about 1 hour. 500 nL of proteinase-K buffer (0.5% TWEEN™ 
20, a detergent: 50 mM Tris buffer pH 7.5, 50 mM NaCI) and 5 nL of proteinase K (20 mg/mL) were 
added, and the sample was incubated for 3 h at 55*C. The proteinase K was inactivated by incubating 
5 at95*Cfor10min. 

Samples of DNA from KS tissue were pooled to provide a consistent source of polynucleotide for 
the amplification reactions. This pool was known to contain DNA from KSHV, as detected by 
amplification of KSHV DNA polymerase sequences, as described in commonly owned U.S. Patent 
Application 60/001 ,148. 

10 

Example 3: Obtaining amplified segments of KSHV Glycoprotein B 

The oligonucleotides obtained in Example 1 were used to amplify segments of the DNA 
extracted from KSHV tissue in Example 2, according to the following protocol. 

15 A first PCR reaction was conducted using 2 \iL of pooled DNA template, 1 jiL of oligonucleotide 

FRFDA (50 pmol/tiL), 1 \± of oligonucleotide TVNCB (50 pmol/jiL), 10 >iL of 10x buffer, 1 jiL containing 
2.5 mM of each of the deoxyribonudeotide triphosphates (dNTPs), 65 jiL distilled water, and 65 |iL 
mineral oil. The mixture was heated to 80*C in a Perkin-Elmer (model 480) PCR machine. 0.5 jiL Taq 
polymerase (BRL, 5 U/^L) and 19.5 jiL water was then added. 35 cycles of amplification were 

20 conducted in the following sequence: 1 min at 94*C, 1 min at the annealing temperature, and 1 min at 
72*C. The annealing temperature was 60*C in the first cycle, and decreased by 2°C each cycle until 
50*C was reached. The remaining cycles were performed using 50*C as the annealing temperature. 

A second PCR reaction was conducted as follows: to 1 pL of the reaction mixture from the 
previous step was added 1 jiL oligonucleotide NIVPA (50 pmol/pL), 1 nL oligonucleotide TVNCB (50 

25 pmol/jiL), 10 jiL of 1 0x buffer, 1 pL dNTPs, 66 jiL water, and 65 \xl mineral oil. The mixture was heated 
to 80*C, and 0.5 nL Taq polymerase in 10.5 jiL water was added. 35 cycles of amplification were 
conducted using the same temperature step-down procedure as before. The PCR product was 
analyzed by electrophoresing on a 2% agarose gel and staining with ethidium bromide. 

The two-round amplification procedure was performed using fourteen test buffers. Five buffers 

30 yielded PCR product of about the size predicted by analogy with other herpes sequences. These 
included WB4 buffer (10x WB4 buffer is 0.67 M Tris buffer pH 8.8. 40 mM MgCI 2 , 0.16 M (NH 4 ) 2 S0 4t 0.1 
M p-mercaptoethanol, 1 mg/mL bovine serum albumin, which is diluted 1 to 10 in the reaction). Also 
tested was WB2 buffer (the same as WB4 buffer, except with 20 mM MgCI 2 in the lOx concentrate). 
Also tested were buffers that contained 10 mM Tris pH 8.3, 3.5 mM MgCI 2 and 25 mM KCI; or 10 mM 

35 Tris pH 8.3, 3.5 mM MbCI 2 and 75 mM KCI; or 10 mM Tris pH 8.8, 3.5 mM MgCI 2 and 75 mM KCI; when 
diluted to final reaction volume. The WB4 buffer showed the strongest band, and some additional 
fainter bands. This may have been due to a greater overall amount of labeled amplified polynucleotide 
in the WB4 sample. 

The product from amplification with WB2 buffer was selected for further investigation. A third 
40 round of amplification was performed to introduce a radiolabel. The last-used oligonucleotide (TVNCB) 
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is end-labeled with gamma M P-ATP, and 1 was added to 20 mL of the reaction mixture from the 
previous amplification step, along with 1 jiL 2.5 mM dNTP. The mixture was heated to 8CTC, and 0.5 nL 
Taq polymerase was added. Amplification was conducted through five cycles of 94"C ( 60*C and 72*C. 
The reaction was stopped using 8.8 jiL of loading buffer from a Circumvent sequencing gel kit. 

A -4 nL aliquot of the radiolabeled reaction product was electrophoresed on a 6% 
polyacrylamide sequencing gel for 1.5 h at 51*C. The gel was dried for 1.5 h, and an autoradiograph 
was generated by exposure for 12 h. Two bands were identified. The larger band had the size 
expected for the fragment from analogy with other gamma herpes virus sequences. 

The larger band was marked and cut out, and DNA was eluted by incubation in 40 \± TE buffer 
(10 mM Tris and 1 mM EDTA, pH. 8.0). A further amplification reaction was performed on the extracted 
DNA, using 1 jiL of the eluate, 10 >iL 10x WB2 butter, 1 jiL 2.5 mM dNTP, 1 nL of each of the second 
set of oligonucleotide primers (NIVPA and TVNCB), and 65 \iL water. The mixture was heated to 80*C. 
and 0.5 jiL Taq polymerase in 19.5 \xL water was added. Amplification was conducted through 35 
cycles, using the temperature step-down procedure described earlier. 

Example 4: Sequence of the 386 base fragment of KSHV Glycoprotein B 

The amplified polynucleotide fragment from the Glycoprotein B gene of KSHV was purified and 
cloned according to the following procedure. 
20 40 |iL of amplification product was run on a 2% agarose gel, and stained using 0.125 fig/mL 

ethydium bromide. The single band at about 400 base pairs was cut out, and purified using a 
QIAGEN™ II gel extraction kit, according to manufacturer's instructions. 

The purified PCR product was ligated into the pGEM™-t cloning vector. The vector was used to 
transform competent bacteria (E. coli JM-109). Bacterial clones containing the amplified DNA were 
25 picked and cultured. The bacteria were lysed and the DNA was extracted using phenol-chloroform 
followed by precipitation with ethanol. Colonies containing inserts of the correct size were used to 
obtain DNA for sequencing. The done inserts were sequenced from both ends using vector-specific 
oligonucleotides (forward and reverse primers) with a SEQUENASE™ 7-deaza dGTP kit, according to 
manufacturer's directions. A consensus sequence for the new fragment was obtained by combining 
30 sequence data obtained from 5 clones of one KSHV Glycoprotein B amplification product. 

The length of the fragment in between the primer hybridizing regions was 319 base pairs. The 
nucleotide sequence is listed as SEQ. ID NO:3 and shown in Figure 1. The encoded polypeptide 
sequence is listed as SEQ. ID NO:4. 

Figure 13 compares the sequence of this Glycoprotein B gene fragment with the corresponding 
35 sequence of other gamma herpes viruses. Single dots (.) indicate residues in other gamma herpes 
viruses that are identical to those of the KSHV sequence. Dashes (-) indicate positions where gaps 
have been added to provide optimal alignment of the encoded protein. The longest stretch of 
consecutive nucleotides that is identical between the KSHV sequence and any of the other listed 
sequences is 14. Short conserved sequences are scattered throughout the fragment Overall, the 
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polynucleotide fragment is 63% identical between KSHV and the two closest herpes virus sequences, 
sHV1 and bHV4. 

The sequence data was used to design Type 3 oligonucleotide primers of 20-40 base pairs in 
length. The primers were designed to hybridize preferentially with the KSHV Glycoprotein B 
5 polynucleotide, but not with other sequenced polynucleotides encoding Glycoprotein B. Example 
primers of this type were listed earlier in Table 7. 

Figure 14 compares the predicted amino add sequence encoded by the same Glycoprotein B 
gene fragment. At the amino acid level, two short segments are shared between KSHV and a 
previously known gamma herpes virus, bHV4. The first (SEQ. ID N0.64) is 13 amino adds in length 
10 and located near the N-terminal end of the fragment. The second (SEQ. ID NO:65) is 1 5 amino acids in 
length and located near the C-terminal end of the fragment. All other segments shared between KSHV 
and other gamma herpes viruses are 9 amino acids or shorter. 

Example 5; Sequence of the 386 base fragment of RFHV Glycoprotein B 

15 

Tissue specimens were obtained from the tumor of a Macaque nemestrina monkey at the 
University of Washington Regional Primate Research Center. The specimens were fixed in 
paraformaldehyde and embedded in paraffin. DNA was extracted from the specimens according to the 
procedure of Example 2. 

20 The presence of RFHV polynucleotide in DNA preparations was determined by conducting PCR 

amplification reactions using oligonucleotide primers hybridizing to the DNA polymerase gene. Details 
of this procedure are provided in commonly owned U.S. Patent Application 60/001,148. DNA extracts 
containing RFHV polynucleotide determined in this fashion were pooled for use in the present study. 
DNA preparations containing RFHV polynucleotide served as the template in PCR amplification 

25 reactions using Glycoprotein B consensus-degenerate oligonucleotides FRFDA and TVNCB, followed 
by a second round of amplification using oligonucleotides Nh/PA and TVNCB. Conditions were 
essentially the same as in Example 3, except that only WB4 buffer produced bands of substantial 
intensity, with the amount of DNA in the initial source and the conditions used. Labeling of the amplified 
DNA was performed with M P end-labeled NIVPA, as before; the product was electrophoresed on a 6% 

30 polyacrylamide gel, and an autoradiogram was obtained. A ladder of bands corresponding to about 386 
base pairs and about 10 higher mol wt concatemers was observed. The 386 base pair band (with the 
same mobility as a simultaneously run KSHV fragment) was cut out of the gel and extracted. 

To determine whether the DNA in this extract was obtained from a specific amplification reaction, 
PCR's were set up using NIVPASQ alone, TVNCBSQ alone, or the two primers together. Buffer 

35 conditions were the same as for the initial amplification reactions. The mixture was heated to 80"C, Taq 
polymerase was added, and the amplification was carried through 35 cycles using the temperature 
step-down procedure. Theoretically, specific amplification reactions accumulate product linearly when 
one primer is used, and exponentially when using two primers with opposite orientation. Thus, 
specificity is indicated by more product in the reaction using both primers, whereas equal product in all 

40 three mixtures suggests non-specific amplification. Amplification products from these test reactions 
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were analyzed on an agarose gel stained with ethidium bromide. The RF extract showed no product for 
the NIVPASQ reaction, a moderate staining band for the TVNCBSQ reaction at the appropriate mobility, 
and an intensely staining band for both primers together. For a KSHV fragment assayed in parallel, 
there was a faint band for the NIVPASQ reaction, no band for the TVNCBSQ reaction, and an intensely 
5 staining band for both primers together. We concluded that the 386 base pair band in the RF extract 
represented specific amplification product. 

Accordingly, 40 \iL of the RF extract that had been amplified with both primers was run 
preparatively on a 2% agarose gel, and the -386 base pair band was cut out. Agarose was removed 
using a QIAGEN™ kit, and the product was cloned in E. coli and sequenced as in Example 4. A 
10 consensus sequence was determined for 3 different clones obtained from the same amplified RFHV 
product. 

The polynucleotide sequence of RFHV Glycoprotein B fragment (SEQ. ID NO:1) is aligned in 
Figure 1 with the corresponding sequence from KSHV. Also shown is the encoded RFHV amino acid 
sequence (SEQ. ID NO:2). Between the primer hybridization regions (nucleotides 36-354), the 
15 polynucleotide sequences are 76% identical; and the amino acid sequences are 01% identical. The 
internal cysteine residue and the potential N-linked glycosylation site are both conserved between the 
two viruses. 

The sequence data was used to design Type 3 oligonucleotide primers of 20-40 base pairs in 
length. The primers were designed to hybridize preferentially with the RFHV Glycoprotein B 
20 polynucleotide, but not with other sequenced polynucleotides encoding Glycoprotein B. Example 
primers of this type were listed earlier in Table 7. 

Figure 15 compares the predicted amino acid sequence encoded by nucleotides 36-354 of the 
Glycoprotein B gene fragment. As for the KSHV sequence, two short segments are shared between 
RFHV and a previously known gamma herpes virus, bHV4. All other segments shared between RFHV 
25 and other gamma herpes viruses are shorter than 9 amino acids in length. 

Figure 16 is an alignment of sequence data for the same Glycoprotein B fragment in the 
spectrum of herpes viruses for which data is available. Figure 17 shows the phylogenetic relationship 
between herpes viruses, based on the degree of identity across the partial Glycoprotein B amino acid 
sequences shown in Figure 16. By amino acid homology, amongst the viruses shown, RFHV and 
30 KSHV are most closely related to bHV4, eHV2, and sHV1 . 

Example 6; Oligonucleotide primers and probes for the RFHV/KSHV subfamily 

Based on the polynucleotide fragment obtained for RFHV and KSHV, seven Type 2 
35 oligonucleotides were designed that could be used either as PCR primers or as hybridization probes 
with members of the RFHV/KSHV subfamily. 

Four consensus-degenerate Type 2 oligonucleotides, SHMDA, CFSSB, ENTFA, and DNIQB are 
shown in Figure 17, alongside the sequences they were derived from. Like the oligonucleotides of 
Example 1, they have a consensus segment towards the 5' end, and a degenerate segment towards 
40 the 3* end. However, these oligonucleotides are based only on the RFHV and KSHV sequences, and 
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will therefore preferentially form stable duplexes with Glycoprotein B of the RFHV/KSHV subfamily. A 
list of exemplary Type 2 oligonucleotides was provided earlier in Table 6. 

Different Type 2 oligonucleotides have sense or antisense orientations. Primers with opposing 
orientations may be used together in PCR amplifications. Alternatively, any Type 2 oligonucleotide may 
5 be used in combination with a Type 1 oligonucleotide with an opposite orientation. 

Example 7: Upstream and downstream Glycoprotein B sequence 

Further amplification reactions are conducted to obtain additional sequence data. The source for 
10 KSHV DNA is Kaposi's Sarcoma tissue, either frozen tissue blocks or paraffin-embedded tissue, 
prepared according to Example 2, or cell lines developed from a cancer with a KSHV etiology, such as 
body cavity lymphoma. Also suitable is KSHV that is propagated in culture (Weiss et al.) 

The general strategy to obtain further sequence data in the 5' direction of the coding strand is to 
conduct amplification reactions using the consensus-degenerate (Type 1) oligonucleotide hybridizing 
15 upstream from the fragment as the 5* primer, in combination with the closest virus-specific (Type 3) 
oligonucleotides as the 3* primers. Thus, a first series of amplification cycles are conducted, for 
example, using FRFDA and TNKYB as the first set of primers. This may optionally be followed by a 
second series of amplification cycles, conducted, for example, using FRFDA and GLTEB as a second 
set of primers. 

20 The conditions used are similar to those described in Examples 3 and 4. The reaction is 

performed in WB4 buffer, using the temperature step-down procedure described in Example 3. After 
two rounds of amplification, the product is labeled using the last-used virus-specific oligonucleotide 
(GLTEB, in this case), end-labeled with gamma ^P-ATP. The labeled product is electrophoresed on 
6% potyacrylamide, and a band corresponding to the appropriate size as predicted by analogy with 

25 other herpes viruses is excised. After re-amplification, the product is purified, cloned, and sequenced 
as before. A consensus sequence for the new fragment is obtained by combining results of about three 
determ I nations . 

In order to obtain further sequence data in the 3' direction of the coding strand, amplifications are 
conducted using consensus-degenerate (Type 1) oligonucleotides hybridizing downstream from the 

30 fragment as the 3' primer, in combination with the closest virus-specific (Type 3) oligonucleotides as the 
5* primers. In one example, a first series of amplification cycles are conducted using NVFDB and 
TVFLA, optionally followed by a second series conducted using NVFDB and SQPVA. Amplification and 
sequencing is performed as before. The new sequence is used to design further Type 3 
oligonucleotides with a sense orientation, which are used with other downstream-hybridizing Type 1 

35 oligonucleotides (such as FREYB and NVFDB) to obtain further sequence data. Alternatively, further 
sequence data in the 3* direction is obtained using Type 1 oligonucleotides with opposite orientation: 
for example, two primers are selected from the group of FRFDA, NIVPA, TVNCA, NIDFB, NVFDB, and 
FREYB; additional primers may be selected for nested amplification. 

To obtain sequence data 3' from the most downstream oligonucleotide primer. Type 1 primers 

40 such as CYSRA, or Type 3 primers such as TVFLA, may be used in combination with primers 
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hybridizing towards the 5' end of the DNA polymerase gene. Oligonucleotide primers hybridizing to the 
DNA polymerase gene of herpes viruses related to RFHV and KSHV are described in commonly owned 
U.S. Patent Application 607001,148. The DNA polymerase encoding region is located 3' to the 
Glycoprotein B encoding region. PCRs conducted using this primer combination are expected to 
5 amplify polynucleotides comprising the 3' end of the Glycoprotein B encoding region, any intervening 
sequence if present, and the 5" end of the DNA Polymerase encoding region. 
This strategy was implemented as follows: 

DNA containing KSHV encoding sequences for Glycoprotein B was prepared from a frozen 
Kaposi's sarcoma sample, designated RiGr, and a cell line derived from a body cavity lymphoma, 
10 designated BC-1. 

In order to obtain the full 5* sequence, a Type 1 oligonucleotide probe was designed for the 
encoding sequence suspected of being upstream of Glycoprotein B: namely, the capsid maturation 
gene (CAPMAT). Known sequences of CAPMAT from other viruses were used to identify a relatively 
conserved region, and design a consensus-degenerate primer designated FENSA to hybridize with 
15 CAPMAT in the sense orientation of Glycoprotein B. A Type 1 oligonucleotide probe was designed for 
the encoding sequence suspected of being downstream of Glycoprotein B: namely, the DNA 
polymerase. These oligonucleotides are listed in Table 9: 



TABLE 9: Additional Type 1 Oligonucleotides used for Detecting, AmplhVtng, or Characterizing 

Herpes Virus Polynucleotides 


Target Cmpsid/Haturatiangene from Herpesviruses, especially fr 


om gamma Herpesviruses 


Designation 


Sequence 
(5 , to3') 


Length 


No. of 
forms 


Orien-tation 


SEQ 
ID: 


FENSAC 


GCCTTTGAGAATTCYAARTAYATHAAR 


27 


48 


sense 


77 


FENSAG 


GGGTTTGAGAATTCYAARTAYATHAAR 


27 


48 


sense 


78 


Target: 


DMA polymerase gene from Herpesviruses, especially from gamnu 


i Herpesviruses 


Designation 


Sequence 


Length 


No. of 
forms 


Orien-tation 


SEQ 
ID: 


CVNVB 


TAAAAGTACAGCTCCTGCCCGAANACRTTN 
ACRCA 


35 


64 


antisense 


79 1 



Amplification was earned out using pairs of sense and antisense primers that covered the entire 
Glycoprotein B encoding region. Fragments obtained include those listed in Table 10. 
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TABLE 10: KSHV Glycoprotein B fragments obtained 




Fragment 


Length 


Position 


1 


NJVPA^TVNCB 


0.39 kb 


original fragment 


2 


FENSA^WVNB 


0.9 kb 


5' of fragment 1 across to CAPMAT 


3 


TVNCA^FREYB 


2.3 kb 


3* of fragment V 


••••4'-.v. 


FAYDA-* FREYB 


0.65 kb 


3* of fragment 1 


• 5 . 


SQPVA^HVLQB 


2.5 kb 


3' of fragment 1 across to DNA polymerase 


6 


FREYA -+ SCGFB 


1.1 kb 


3' of fragment 2 across to DNA polymerase 



The protocol used for amplifying and sequencing was as follows: PCR amplification was carried 
out using the DNA template with the primer pair (e.g., FREYA and SCGFB). 35 cycles were conducted 
of 94°C for 45 sec, 60°C for 45 sec, and 72°C for 45 sec; and then followed by a final extension step at 
5 72°C for 10 min. PCR products of the predicted length were purified on agarose gels using the 
QIAQUICK™ PCR purification kit from Quiagen. Purified PCR products were reamplrfied in a second 
round of amplification. The second round was conducted alternatively in a nested or non-nested 
fashion, in the example given, second-round amplification was conducted using FREYA and SCGFB, 
or with FREYA and HVLQB. Amplification for 35 cycles was conducted at 94°C for 45 sec, 65°C for 45 

1 0 sec, and 72*C for 45 sec; and then followed by a final extension step at 72°C for 60 min. 

The PCR products were ligated into the Novagen PT7 BLUE™ vector, and transformed into 
Novablue competent E. coli. Ligations and transformations were performed using Novagen protocols. 
Colonies were screened by PCR using M13 forward and reverse oligonucleotides. Using the Quiaquick 
plasmid isolation kit, plasmids were isolated from PCR positive colonies that had been grown up 

1 5 overnight in 5 mL LB broth at 37°C. Manual sequencing of the plasmids using M1 3 forward and reverse 
sequencing primers was performed following the USB Sequenase Kit protocol (USB). Automated 
sequencing was performed by ABI methods. 

Additional KSHV-specific Type 3 oligonucleotides were designed as the KSHV sequence 
emerged. Type 3 oligonucleotides were used in various pair combinations or with Type 1 

20 oligonucleotides to PCR amplify, clone, and sequence sections of the KSHV DNA. The Type 3 
oligonucleotides used are listed in Table 11: 
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TABLE 11: Additional Type 3 Oligonucleotides used for Detecting, Amplifying, 
Characterizing Herpes Virus Polynucleotides encoding Glycoprotein B 


or 




Target Gfycopn^B from KSHV 


Deslg- 
nation 


Sequence 
■(5' to 3*) 


Length 


No. of 
forms 




SEQ 
ID: 


Orientation 


GAYTA 


TGTGGAAACGGGAGCGTACAC 


21 




sense 


BO 


rnvcD 
U l Tod 


TCAGACAAGAGTACGTGTCGG 


21 




anti-sense 


81 


AIYGB 


TACAGGTCGACCGTAGATGGC 


21 




anti-sense 


82 


VTECA 


CGCCATTTCCGTGACCGAGTG 


21 




sense 


83 


CEHYB 


TGATGAAGTAGTGTTCGGAGG 


21 




anti-sense 


84 


DLGGB 


GATGCCACCCAGGTCCGCCAC 


21 




anti-sense 


85 


DLGGA 


GTGGCGGACCTGGGTGGCATC 


21 




sense 


86 


RAPPA 


CGTAGATCGCAGGGCACCTCC 


21 




sense 


87 


Target: DHA Polymerase from KSHV 






Desig- 
nation 


Sequence 

^•toy) 


Length 


No. of 
forms 


Orientation 


SEQ 
tD: 


GEVFB 


GTCTCTCCCGCGAATACTTCT 


21 


1 


antisense 


88 


HVLQB 


GAGGGCCTGCTGGAGGACGTG 


21 


1 


antisense 


89 


SCGFB 


CGGTGGAGAAGCCGCAGGATG 


21 


1 


antisense 


90 



Figure 18 is a map showing the location where oligonucleotides hybridize with the KSHV DNA. 
Abbreviations used are as follows: d or h = consensus-degenerate probes that hybridize with 
herpesvirus sequences (Type 1), sq = additional sequencing tail available, g = probes that hybridize 
with gamma herpesviruses (Type 1), f = probes that hybridize with KSHV/RFHV family of herpesviruses 
(Type 2), ks = probes specific for KSHV (Type 3). 

Figure 19 lists a consensus sequence obtained by compiling sequence data from each of the 
characterized fragments. The polynucleotide sequence (SEQ. ID NO:91) is shown. Nucleotides 1-3056 
(SEQ. ID NO:92) incorporating the region before the DNA polymerase encoding sequence is an 
embodiment of this invention. This consensus sequence represents the consensus of data obtained 
from both the Kaposi's sarcoma sample RiGr, and the lymphoma cell line BC-1, with a plurality of 
clones being sequenced for each sample and each gene segment. Between about 3-9 determinations 
have been performed at each location. 

Also shown In Figure 19 is the amino acid translation of the three open reading frames (SEQ. ID 
NOS:93-95). The encoded CAPMAT protein fragment (SEQ. ID NO:93) overlaps the 5* end of the 
Glycoprotein B encoding sequence (SEQ. ID NO:94) in a different phase. Further upstream, the 
CAPMAT encoding sequence is also suspected of comprising control elements for Glycoprotein B 
transcription, due to homology with the binding site for RNA polymerase 2 of Epstein Ban Virus. This 
putative promoter region is underlined in the Figure. At the 3' end of the Glycoprotein B encoding 



76 



WO 97/12042 



PCT/US96/15702 



sequence, there is an untranslated sequence including a polyadenlyation signal. Further downstream is 
the encoding sequence for a DNA Polymerase fragment (SEQ. ID NO:95). 

When the Glycoprotein B encoding sequence was compared with other sequences on GenBank, 
homology was found only with Glycoprotein B sequences from other herpes viruses. Occasional 
5 sequences of 20 nucleotides or less are shared with several herpes viruses. The sequence 
ATGTTCAGGGAGTACAACTACTACAC (SEQ. ID NO:98) is shared with eHV2. Other than this 
sequence, segments of the KSHV encoding region 21 nucleotides or longer are apparently unique, 
compared with other previously known sequences. 

Within the Glycoprotein B encoding sequence, four allelic variants were noted at the 
10 polynucleotide level between sequence data obtained using the Kaposi's sarcoma sample and that 
obtained using the body cavity lymphoma cell line. These are indicated in the Figure by arrows. All but 
one of the variants was silent. The fourth variant causes a difference of Proline to Leucine in the gene 
product 

The protein product encoded by the KSHV Glycoprotein B gene has the following features: 
15 There is a domain at the N-terminus that corresponds to the signal-peptide domain (the 'leader*) of 
Glycoprotein B other herpes viruses. The complete KSHV Glycoprotein B amino acid sequence with 
that known for other herpes viruses is provided in Figure 3, and reveals areas of homology. Residues 
highly conserved amongst herpes virus Glycoprotein B sequences are marked with an asterisk (*). The 
cysteine residues conserved amongst other herpes virus Glycoprotein B sequences are also present in 
20 that of KSHV. In addition, there are two additional cysteines which could form an additional internal 
disulfide and stabilize the three-dimensional structure (marked by "•")■ The KSHV Glycoprotein B 
sequence also has a predicted membrane-spanning domain that corresponds to that on Glycoprotein B 
of other herpes viruses. 

The full glycoprotein B sequence of RFHV is obtained by a similar strategy. The source for 
25 RFHV DNA is similarly prepared tissue from infected monkeys at the University of Washington Regional 
Primate Research Center. DNA is extracted as described in Example 5. 

In order to obtain further sequence data in the 5' direction of the coding strand, amplifications are 
conducted using the consensus-degenerate (Type 1) oligonucleotide hybridizing upstream from the 
fragment as the 5' primer, in combination with the closest virus-specific (Type 3) oligonucleotides as the 
30 3* primers. Thus, a first series of amplification cycles are conducted, for example, using FRFDA and 
AAITB as the first set of primers. This is followed by a second series of amplification cycles, conducted 
the same primers, or using the nested set FRFDA and GMTEB. Amplification conditions are similar to 
those described for KSHV. 

In order to obtain further sequence data in the 3* direction of the coding strand, amplifications are 
35 conducted using consensus-degenerate (Type 1) oligonucleotides hybridizing downstream from the 
fragment as the 3* primer, in combination with the closest virus-specific (Type 3) oligonucleotides as the 
5' primers. Thus, a first series of amplification cycles are conducted using NVFDB and VEGLA, 
followed by a second series conducted using NVFDB and PVLYA. Amplification and sequencing is 
performed as before. The new sequence is used to design further Type 3 oligonucleotides with a sense 
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orientation, which are used with other downstream-hybridizing Type 1 oligonucleotides (namely FREYB 
and NVFDB) to obtain further sequence data. 

Polynucleotide and amino acid sequence data is used to compare the Glycoprotein B of RFHV 
and KSHV with each other, and with that of other herpes viruses. The RFHV and KSHV sequences 
5 may be used to design further subfamily-specific Type 2 oligonucleotides, as in Example 6. 

Example 8: Glycoprotein B sequences from DNA libraries 

Complete Glycoprotein B sequences can be obtained or confirmed by generating DNA libraries 

1 0 from affected tissue. Sources of DNA for this study are the same as for Example 7. 

The DNA lysate is digested with proteinase K, and DNA is extracted using phenol-chloroform. 
After extensive dialysis, the preparation is partially digested with the Sau3A I restriction endonuclease. 
The digest is centrifuged on a sucrose gradient, and fragments of about 10-23 kilobases are recovered. 
The lambda DASH-2™ vector phage (Stratagene) is prepared by cutting with BamHI. The 

1 5 size-selected fragments are then mixed with the vector and ligated using DNA ligase. 

The ligated vector is prepared with the packaging extract from Stratagene according to 
manufacturer's directions. It is used to infect XL1-BLUE™ MRA bacteria. About 200,000 of the 
phage-infected bacteria are plated onto agar at a density of about 20,000 per plate. After culturing, the 
plates are overlaid with nitrocellulose, and the nitrocellulose is cut into fragments. Phage are eluted 

20 from the fragments and their DNA are subjected to an amplification reaction using appropriate 
virus-specific primers. The reaction products are run on an agarose gel, and stained with ethidium 
bromide. Phage are recovered from regions of the plate giving amplified DNA of the expected size. 
The recovered phage are used to infect new XL1 bacteria and re-plated in fresh cultures. The process 
is repeated until single clones are obtained at limiting dilution. 

25 Each clone selected by this procedure is then mapped using restriction nucleases to ascertain 

the size of the fragment incorporated. Inserts sufficiently large to incorporate the entire Glycoprotein B 
sequence are sequenced at both ends using vector-specific primers. Sequences are compared with 
the known polynucleotide sequence of the entire EBV genome to determine whether the fragment 
spans the intact Glycoprotein B sequence. DNA is obtained from suitable clones, sheared, and 

30 sequenced by shot-gun cloning according to standard techniques. 

Example 9: Antigenic regions of Glycoprotein B 

The polynucleotide fragments between the hybridization sites for NIVPA and TVNCB in the 
35 Glycoprotein B gene have the predicted amino acid sequences shown in Figure 14. Based on these 
sequences, peptides that are unique for RFHV or KSHV, or that are shared between species can be 
identified. 

Figure 14 shows example peptides of 6 or 7 amino acids in length. Some of the peptides 
comprise one or more residues that are distinct either for RFHV or KSHV (Class III), or for the 
40 RFHV/KSHV subfamily (Class II) compared with the corresponding gamma herpes virus peptides. 

7* 
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To confirm that regions contained within this 106-amino add region of Glycoprotein B may be 
recognized by antibody, computer analysis was performed to generate Hopp and Woods antigenicity 
plots. The Hopp and Woods determination is based in part on the relative hydrophilicity and 
hydrophobicity of consecutive amino acid residues (Hopp et al). Results are shown in Figures 20, 
5 21 and 22. Key: - = antigenic; A = hydrophobic; # = potential N-linked glycosylate site. Figure 20 
shows the analysis of the 106 amino acid Glycoprotein B fragment from RFHV; Figure 21 shows the 
analysis of the KSHV fragment, and Figure 22 shows the analysis of the full KSHV sequence. 

Both RFHV and KSHV contain several regions predicted to be likely antibody target sites. In 
particular, the KSHV sequence shows an antigenic region near the N-terminal end of this fragment, and 
10 near the potential N-linked glycosylation site. The full-length KSHV sequence shows hydrophobic 
minima corresponding both to the signal peptide (residue -25) and the transmembrane domain (residue 
-750). A number of putative antigenic regions with scores >1.0 or >1.5 are observed. Particularly 
notable is a region scoring up to -2.5 that appears at about residues 440-460. 

15 Example 10: Virus specific Glycoprotein B amplification assays 

Type 3 oligonucleotides are used in nested virus-specific amplification reactions to detect the 
presence of RFHV or KSHV in a panel of tissue samples from potentially affected subjects. 

For KSHV, DNA is extracted from tissue suspected of harboring the virus; particularly biopsy 

20 samples from human subjects with Kaposi's Sarcoma lesions and body cavity B-ceil lymphoma. A 
number of different tissue samples are used, including some from KS lesions, some from apparently 
unaffected tissue in the same individuals, some from HIV positive individuals with no apparent KS 
lesions, and some from HIV negative individuals. Five samples are obtained in each category. DNA is 
prepared as described in Example 2. 

25 The oligonucleotide primers GLTEA, YELPA, VNVNB, and ENTFB are ordered from Oligos Etc., 

Inc. The DNA is amplified in two stages, using primers GLTEA and ENTFB in the first stage, and 
YELPA and VNVNB in the second stage. The conditions of the amplification are similar to those of 
Example 3. The reaction product is electrophoresed on a 2% agarose gel, stained with ethidium 
bromide, and examined under U.V. light. A positive result is indicated by the presence of abundant 

30 polynucleotide in the reaction product, as detected by ethidium bromide staining. This reflects the 
presence of KSHV derived DNA in the sample; specifically, the Glycoprotein B encoding fragment from 
YELPA to VNVNB. Results are matched with patient history and sample histopathology to determine 
whether positive assay results correlate with susceptibility to KS. 

For RFHV, DNA is extracted from frozen tissue samples taken from Macaca nemestrina and 

35 Macaca fascicularis monkeys living in the primate colony at the Washington Regional Primate 
Research Center. Ten samples are taken each from tissue sites showing overt symptoms of 
fibromatosis, apparently unaffected sites in the same monkeys, and corresponding sites in monkeys 
showing no symptomatology. Nested PCR amplification is conducted first using GMTEA and VEGLB, 
then using KYEIA and TDRDB. Amplification product is electrophoresed and stained as before, to 

40 determine whether RFHV polynucleotide is present in the samples. 

Hi 
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Example 11: Immunogenic regions of Glycoprotein B 

To identify what antibodies may be generated during the natural course of infection with KSHV. 
5 serum samples are obtained from 10-20 AIDS subjects with Kaposi's Sarcoma lesions, from 10-20 
HIV-positive symptom-negative subjects, and 10-20 HIV-negative controls. In initial studies, sera in 
each population are pooled for antibody analysis. 

Peptides 12 residues long are synthesized according to the entire predicted extracellular domain 
of the mature KSHV Glycoprotein B molecule. Sequential peptides are prepared covering the entire 

1 0 sequence, and overlapping by 8 residues. The peptides are prepared on a nylon membrane support by 
standard F-Moc chemistry, using a SPOTS™ kit from Genosys according to manufacturer's directions. 
Prepared membranes are overlaid with the serum, washed, and overlaid with beta-galactose 
conjugated anti-human IgG. The test is developed by adding the substrate X-gal. Positive staining 
indicates IgG antibody reactivity in the serum against the corresponding peptide. 

15 Similarly, to identify antibodies formed in the natural course of RFHV infection, blood samples 

are collected from 10 Macaca nemestrina and 10 Macaca fascicularis monkeys, a proportion of which 
display overt symptoms of fibromatosis. The presence or absence of an ongoing RFHV infection is 
confirmed by conducting PCR amplification assays using RFHV-specific oligonucleotides as in Example 
1 0. Plasma and blood cells are separated by centrifugation. These sera are used to test for antibodies 

20 in a method similar to that for KSHV, except that 12-mers are synthesized based on the RFHV 
Glycoprotein B sequence. 

Select RFHV and KSHV peptides are also tested in animal models to determine immunogenic^ 
when administered in combination with desirable adjuvants such as alum and DETOX™. Suitable 
peptides include those identified in the aforementioned experiment as eliciting antibody during the 

25 natural course of viral Infection. Other candidates include those believed to participate in a biological 
function of Glycoprotein B, and those corresponding to peptides of other herpes viruses known to elicit 
viral neutralizing antibodies. The peptides are coupled onto keyhole limpet hemocyanin (KLH) as a 
carrier, combined with adjuvant according to standard protocols, and 100 *ig peptide equivalent in 1-2 
mL inoculum is injected intramuscularly into rabbits. The animals are boosted with a second dose 4 

30 weeks later, and test-bled after a further 2 weeks. 

Microtiter plate wells are prepared for ELISA by coating with the immunogen or unrelated 
peptide-KLH control. The wells are overlaid with serial dilutions of the plasma from the test bleeds, 
washed, and developed using beta-galactose anti-human IgG and X-gal. Positive staining in the test 
wells but not the control wells indicates that the peptide is immunogenic under the conditions used. 

35 

Example 12: Identification and characterization of Glycoprotein B from other members of 
the RFHV/KSHV subfamily 



40 



Tissue samples suspected of containing a previously undescribed gamma herpes vims, 
particularly fibroproliferative conditions, lymphocyte malignancies, and conditions associated with 

$6 
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immunodeficiency and immunosuppression, such as acute respiratory disease syndrome (ARDS), are 
preserved by freezing, and the DNA is extracted as in Example 2. Two rounds of PCR amplification are 
conducted using Type 1 oligonucleotides FRFDA and TVNCB in the first round, then using nested Type 
1 or Type 2 oligonucleotides in the second round. 
5 Optionally, the presence of an RFKV/KSMV family Glycoprotein B polynucleotide is confirmed by 

probing the amplification product with a suitable probe. The amplified polynucleotide is electrophoresed 
in agarose and blotted onto a nylon membrane. The blot is hybridized with a probe comprising the 
polynucleotide fragment obtained from the KSHV polynucleotide encoding Glycoprotein B (residues 36- 
354 of SEQ. ID NO:3), labeled with M P. The hybridization reaction is done under conditions that will 

10 permit a stable complex forming between the probe and Glycoprotein B from a herpes virus, but not 
between the probe and Glycoprotein B encoding polynucleotides from sources outside the RFHV/KSHV 
subfamily. Hybridization conditions will require approximately 70% identity between hybridizing 
segments of the probe and the target for a stable complex to form. These conditions are calculated 
using the formula given earlier, depending on the length and sequence of the probe and the 

15 corresponding sequence of the target. The conditions are estimated to be: a) allowing the probe to 
hybridize with the target in 6 x SSC (0.15 M NaCI, 15 mM sodium citrate buffer) at room temperature in 
the absence of fbrmamide; and b) washing newly formed duplexes for a brief period (5-10 min) in 2 x 
SSC at room temperature. 

Amplified polynucleotides that hybridize to the labeled probe under these conditions are selected 

20 for further characterization. Alternatively, PCR amplification products having about the same size as 
that predicted from the KSHV are suspected of having a related sequence. Samples may also be 
suspected of having a related sequence if they have been used to obtain polynucleotides 
encompassing other regions of a herpes vims genome, such as DNA polymerase. Samples containing 
fragments potentially different from RFHV or KSHV, either due to a size difference or different origin, 

25 are sequenced across the fragment as in Example 4. Those with novel sequences are used to 
determine the entire Glycoprotein B gene sequence by a method similar to that in Example 7 or 8. 

A Glycoprotein B encoding sequence from a third member of the RFHV/KSHV herpes virus 
subfamily was obtained as follows. 

DNA was extracted from two frozen tissue samples from a Macaca mulatta monkey with 

30 retroperitoneal fibromatosis. Extraction was conducted according to Example 1. The extracted DNA 
was precipitated with ethanol in the presence of 40 m glycogen as carrier, washed in 70% ethanol, and 
resuspended in 10 mM Tris buffer, pH 8.0. The extracted DNA was used to obtain a 151 base pair 
fragment of a herpes vims DNA polymerase gene, which was non-identical to that of KSHV. RFHV, or 
any other previously characterized DNA polymerase. This lead to the suspicion that the sample 

35 contained genomic DNA from a different herpes vims, that could be used to identify and characterize a 
new Glycoprotein B gene. 

A 386 base pair fragment of a Glycoprotein B encoding sequence was amplified from the sample 
using a hemt-nested PCR. The procedure was similar to that used In Examples 4 and 5, with a first 
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round of amplification using FRFDA and TVNCB, followed by a second round of amplification using 
NIVPA and TVNCB. The final PCR product was sequenced as before. 

Figure 23 lists the polynucleotide sequence (SEQ. ID NO:96) along with the corresponding 
amino acid translation (SEQ. ID NO:97). Underlined is the 319 base pair sequence in between the two 
5 primer hybridization sites. The sequences are different from those of KSHV and RFHV. The 
Glycoprotein B is from a new member of the RFHV/KSHV subfamily of herpes viruses, designated 
RFHV2. 
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Sequences 



] 



SEQ. 
10 



Designation 



Description 



Type 



Source 



1 


RFHV 


Glycoprotein B PCR segment 


dsDNA 


Figure 1 


| 2 ; : 


RFHV 


Glycoprotein B PCR segment 


Protein 


Figure 1 




KSHV 


GIvcoDFOtein B PCR seoment 


□SUrMA 


Figure 1 




KSHV 


Glycoprotein B PCR segment 


Protein 


Figure 1 




SHV1 


Glycoprotein B sequence 


dsDNA 


GenBank 
HSVSPOLGBP 




bHV4 


Glycoprotein B sequence 


dsDNA 


GenBank 
BHT4GLYB 


7 


eHV2 


Glycoprotein B sequence 


dsDNA 


GenBank , 
EHVU20824 


8 


mHV68 




OSUNA 


GenBank 
MVU08990 


9 


hEBV 


Glycoprotein B sequence 


dsDNA 


GenBank 
EBV 




hCMV 


Glycoprotein B sequence 


dsDNA 


GenBank 
HEHCMVGB 


11 


hHV6 


Glycoprotein B sequence 


dsDNA 


GenBank 
HH6GBXA 


12 


hVZV 


Glycoprotein B sequence 


dsDNA 


GenBank 
ncVtVAA 


13 


HSV1 


Glycoprotein B sequence 


dsDNIA 


wnDariR no IOLTD 


14 


sHV1 


Gtvcoototein B sentience 


Protein 


Translation 


15 


bHV4 


Glycoprotein B sequence 


Protein 


Translation 


16 


eHV2 


Glycoprotein B sequence 


Print Atn 


Translation 


17 


mHV68 


Glycoprotein B sequence 


Protein 


Translation 


18 


hEBV 


Glycoprotein B sequence 


Protein 


Translation 


19 


hCMV 


Glycoprotein B sequence 


Protein 


Translation 


20 


hHV6 


Glycoprotein B sequence 


Protein 


Translation 


21 


hVZV 


Glycoprotein B sequence 


Protein 


Translation 


22 


HSV1 


Glycoprotein B sequence 


Protein 


Translation 


23 


sHVSAS 


Glycoprotein B sequence 


Protein 


Translation 


2440 




TYPE 1 oligonucleotides 
(Gamma herpes Glycoprotein 8) 


ssDNA 
(IUPAC) 


Table 4 


41-47 




TYPE 2 oligonucleotide 
(RFHV/KSHV subfamily Glycoprotein B) 


ssDNA 
(IUPAC) 


Table 6 
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SEQ. 
ID 


Designation 


Description 


Type 


Source 




48-55 




TYPE 3 oligonucleotides - 
RFHV specific Glycoprotein B 


ssDNA 


Table 7 


5*63 




TYPE 3 oligonucleotides - 
KSHV specific Glycoprotein B 


ssDNA 


Table 7 


64« 




CLASS 1 antigen peptides 
(Gamma herpes Glycoprotein 8) 


Protein 


Table 8 


67-72 




CLASS II antigen peptides 
(RFHV/KSHV subfamily Glycoprotein B) 


Protein 


Tabled 


73-74 




CLASS III antigen peptides- 
RFHV specific Glycoprotein B 


Protein 


Table 8 


75-76 




CLASS III antigen peptide s- 
KSHV specific Glycoprotein B 


Protein 


Tabled 


77-78 




TYPE 1 oligonucleotide 
(Gamma herpes Capsid maturation) 


ssDNA 
(IUPAC) 


Table 9 


79 




TYPE 1 oligonucleotide 
(Gamma herpes DAW polymerase) 


ssDNA 
(IUPAC) 


Table 9 






KSHV soeafic GtvooomtBin R 




ToKIa 14 I 
I BDie 11 H 


88-90 




TYPE 3 oligonucleotides — 
KSHV specific DAW Polymerase • 




Tahle 1 1 


91 


KSHV 


DNA sequence comprising encoding regions 
for Capsid Maturation fragment, Glycoprotein 
B, and DNA polymerase fragment 


dsDNA 


Figure 18 


92 


KSHV 


DNA sequence comprising encoding regions 
for Capsid Maturation fragment and 
Glycoprotein B (residues 1-3056) 


dsDNA 


Example 7 


93 


KSHV 


Capsid Maturation sequence 


Protein 


Figure 18 


94 


KSHV 


Glycoprotein B sequence 


Protein 


Figure 18 


95 


KSHV 


DAW polymerase sequence 


Protein 


Figure 18 


96 


RFHV2 


G/KCGp/ote/r>BPCR segment 


dsDNA 


Figure 22 


97 


RFHV2 


Glycoprotein B PCR segment 


Protein 


Figure 22 


h 98 




Shared sequence 


dsDNA 


Example 7 


| 99-100 




CLASS I antigen peptides olGfycoprotein B 


Protein 


Tab*e8 [ 



«7 
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Partial Sequence Listimg 



5 (1) GEHERAL INFORMATION: 

(i) APPLICANT: Rose, Timothy M. 

Bosch. Marnix 
Strand. Kurt 

(11) TITLE OF INVENTION: GLYCOPROTEIN B Of THE RFHV/KSHV 
10 SUBFAMILY OF HERPES VIRUSES 

(111) NUMBER OF SEQUENCES: 40 
(1v) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Morrison & Foerster 

(B) STREET: 755 Page Mill Road 
15 (C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304 1018 
(V111) ATTORNEY/AGENT INFORMATION: 

20 (A) NAK: Schlff. J. Michael 

(B) REGISTRATION NUMBER: 40.253 

(C) REFERENCE/DOCKET NUMER: 29938-20002.00 
(1x) TELECOHUNICATION INFORMATION: 

(A) TELEPHONE: (415) 813-5600 
25 (B) TELEFAX: (415) 494-0792 

(C) TELEX: 706141 

SEQ ID N0:1: 

CT STACAAG A AGAACATCGT GCCGTACATT TTCAAGGTAC GCAGGTACAT AAAAATAGCA 60 

30 ACATCTCTCA CGGTCTACCG CGGTATGACA GAAGCAGCAA TCACAAACAA ATATGAGATC 120 

CCCAGGCCCG TGCCTCTCTA CGAGATCAGT CACATGGACA GCACCTACCA GTQCTTTAGT 180 

TCCATGAAAA TTGTAGTGAA CGGAGTCGAA AATACGTTCA CCGATCGGSA TGACGTAAAC 240 

AAAACCGTAT TTCTCCAGCC CGTCGAAQGT CTAACTGACA ACATACAAAG ATACTTTAGC 300 

CAACCACTAC TGTACTCTGA ACCCGGATGG TTCCCAGGTA TCTACAGGST TGGGACAACA 360 

35 GTAAACTGTC AGATTGTAGA CATGTT 386 

SEQ ID N0:2: 

Val Tyr Lys Lys Asn He Val Pro Tyr lie Phe Lys Val Arg Arg Tyr 
He Lys He Ala Thr Ser Val Thr Val Tyr Arg Gly Met Thr Glu Ala 

40 Ala He Thr Asn Lys Tyr Glu He Pro Arg Pro Val Pro Leu Tyr Glu 

He Ser His Met Asp Ser Thr Tyr Gin Cys Phe Ser Ser Met Lys He 
Val Val Asn Gly Val Glu Asn Thr Phe Thr Asp Arg Asp Asp Val Asn 
Lys Thr Val Phe Leu Gin Pro Val Glu Gly Leu Thr Asp Asn He Gin 
Arg Tyr Phe Ser Gin Pro Val Leu Tyr Ser Glu Pro Gly Trp Phe Pro 

45 Gly He Tyr Arg Val Gly Thr Thr Val Asn Cys Glu He Val Asp Met 

SEQ ID N0:3: 

GTGTACAAGA AGAACATCGT GCCGTATATT TTTAAGGTGC GGCGCTATAG GAAAATTGCC 60 
ACCTCTGTCA CGGTCTACAG GGGCTTGACA GAGTCCGCCA TCACCAACAA GTATGAACTC 120 
50 CCGAGACCCG TGCCACTCTA TGASATAAGC CACATGGACA GCACCTATCA GTGCTTTAGT 180 
TCCATGAAGG TAAATGTCAA CGQGGTAGAA AACACATTTA CTGACAGAGA CGATG7TAAC 240 
ACCACASTAT TCCTCCAACC AGTAGAGGGG CTTACGGATA ACATTCAAAG GTACTTTAGC 300 
CAQCCGGTCA TCTACGCGGA ACCCGGCTGG TTTCCCGGCA TATACAGAGT TMGACAACA 360 
GTCAACTCTG AGATTGTAGA CATGTT 386 

55 

SEQ ID N0:4: 

Val Tyr Lys Lys Asn lie Val Pro Tyr He Phe Lys Val Arg Arg Tyr 
Arg Lys He Ala Thr Ser Val Thr Val Tyr Arg Gly Leu Thr Glu Ser 
Ala lie Thr Asn Lys Tyr Glu Leu Pro Arg Pro Val Pro Leu Tyr Glu 
60 He Ser His Met Asp Ser Thr Tyr Gin Cys Phe Ser Ser Met Lys Val 

Asn Val Asn Gly Val Glu Asn Thr Phe Thr Asp Arg Asp Asp Val Asn 
Thr Thr Val Phe Leu Gin Pro Val Glu Gly Leu Thr Asp Asn lie Gin 
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Arg Tyr Phe Ser Gin Pro Val lie Tyr Ala Glu Pro Gly Trp Phe Pro 
Gly He Tyr Arg Val Arg Thr Thr Val Asn Cys Glu He Val Asp Met 

SEQ ID N0:5: 

5 ATQGTACCTA ATAAACACTT ACTGCTTATA ATTTTGTCGT TTTCTACTGC ATGTOGACAA 60 

ACGACACCTA CTACAGCTCT TGAAAAAAAT AAAACTCAAG CTATATACCA AGAGTATTTC 120 

AAATATCGTG TATGTAGTQC ATCAACTACT GGAGAATTGT TTAGATTTGA TTTAGACAGA 180 

ACTTGTCCAA GTACTGAAGA CAAAGTTCAT AAGGAAGGCA TTCT1TTAGT GTACAAAAAA 240 

AATATAGTTC CATATATCTT TAAAGTCAGA AGATACAAAA AAATCACAAC ATCAGTCCGT 300 

10 ATTTTTAATG GCTGGACTAG AGAAGGTCGTT GCTATTACAA ACAAATGQGA ACTTTCTAGA 360 

GCTGTTCCAA AATATGAGAT AGATATTATG GATAAGACTT ACCAATCTCA TAATTQCATG 420 

CAGATAGAAG TAAACGGAAT GTTAAATTCT TACTATGACA GAGA1GGAAA TAACAAAACT 480 

GTAGACTTAA AGCCTGTAGA 7GGTCTAACG GGTGCAATTA CAAGATACAT TAGCCAACCT 540 

AAAGTTTTTG CTGATCCTOG CTOGCTATGG GGAACTTACA GGACTCGAAC TACCGTTAAC 600 

15 TGTGAAATTC TAGACATGTT TBCTAGGTCT GCTGACOCTT ACACATACTT TGTGACTQCG 660 

CTTGBCGACA CAGTAGAAGT GIUCCIIIC TGTGA7GTAG ATAATTCATG CCCAAATGCA 720 

ACTGAOGTGT TGTCAGTACA AATAGACTTA AATCACACTG T7GTTGACTA TGGAAATAGA 780 

GCTACATCAC AGCAGCATAA AAAAAGAATA TTTGCTCATA CTTTAGATTA nC l G TI I l l 840 

TGQGAAGCTG TAAACAAATC CGCGTCAGTA TGCTCAATGG TTTTTTGGAA GAGT7TTCAA 900 

20 CGAGCTATCC AAACTCAACA TCACTTAACT TATCATTTCA TTOCTAATGA AATAACAGCA 960 

GGATTCTCTA CAGTGAAAGA ACCCTTAGCA AATTTTACAA GTGATTACAA TTG TC T TA TG 1020 

ACTCATATCA ACACTACTTT AGAGGATAAG ATAGCAAGAG TCAACAATAC TCACACTCCA 1080 

AATGGTACAG CAGAATATTA TCAAACAGAA GGTGGAATGA TTTTAGTGTG GCAGCCATTA 1140 

ATAGCAATAG AATTAGAAGA AGCAATGTTG GAAGCAACTA CATCTCCAGT AACTCCTAGT 1200 

25 GCACCAACTA GCTCATCTAG AAGTAAGCGA GCAATAAGAA GCATAAGAGA TGTOAGTOCA 1260 

GGTTCAGAAA ATAATCTGTT TCTATCACAA ATACAATATG CATATGATAA GCTACGTCAA 1320 

AGTATCAACA ACGTGCTAGA AGAGTTAGCT ATAACATGGT GTAGAGAACA AGTGAGACAA 1380 

ACAATGGTGT GGTATGAGAT AGCAAAAATT AATCCAACAA GTGTTATGAC AGCAATATAT 1440 

GGAAAACCTG TCTCTCGTAA AGCTTTAGGA GATGTAATCT C1GTTACAGA ATGTATAAAT 1500 

30 GTTGACCAAT CTAGTCTGAG CATACACAAG AGTCTTAAAA CAGAAAATAA TGACATATGC 1560 

TATTCACGGC CTCCAGTTAC ATTTAAATTT GTTAACAGTA GTCAGCTCTT TAAAGGACAG 1620 

TTAGGGGCTA GAAATGAAAT lUIUblLA GAAAGTCTTG TAGAAAATTG CCACCAAAAT 1680 

GCAGAGACTT TTTTTACAGC TAAAAATGAA ACTTACCACT TTAAAAATTA TC7GCATGTA 1740 

GAAACTTTGC CAGTCAATAA CATTTCAACT TTAGACACTT TTTTAGCTCT TAACCTAACT 1800 

35 TTCATAGAAA ATATTGACTT TAAAGCTCTT GAATTGTATT CAAGTGGAGA GAGAAAGTTA 1860 

GCAAACGTGT TTGATTTAGA GACTATGTTT AGAGAATATA ACTATTACGC TCAGAGTATA 1920 

TCTGGCTTAA GAAAAGATTT TGATAACTCT CAAAGAAACA ACAGAGACAG AATCATTCAA 1980 

GATTTTTEAG AAATTCTAGC AGACTTAGGC TCTATCGGCA AAGTTATTGT TAATGTGGCA 2040 

AGOGGCGCAT IllCltllll TGGAGGTATT GTAACAGGCA TATTAAATTT TATTAAAAAT 2100 

40 CCTTTAGGTG GCATGTTCAC ATTTCTATTA ATAGGAGCAG TTATAATOT AGTAATTCTA a60 

CTAGTACGGC GCACAAATAA TATGTCTCAA GCTCCAATTA GAATGATTTA CCCAGATGTT 2220 

GAGAAATCTA AATCTACTGT GACGCCTATG GAGCCTCAAA CAATTAAACA AATTTTGCTT 2280 

GGAA1GCATA ACATGCAGCA AGAAGCATAT AAGAAAAAAG AAGAACAAAG AGCTGCTAGA 2340 

CCGTCTATTT TTAGACAAGC TGCTGAGACA TTTTTGCGTA AGCGATCTGG TTACAAACAG 2400 

45 ATTTCAACCG AAGACAAAAT AGTAT 2425 

SEQ ID NO:6: 

ATGTATTATA AGACTATOT ATTCTTCGCT CTAATTAAGG TATGCAGTTT CAACCAGACC 60 

ACTACACACT CAACCACAAC CTCACCAAGT ATTTCATCAA CCACCTCTTC CACAACAACA 120 

50 TCAACAAGCA AGCCATCAAA CACAACCTCA ACAAATAGTT CATTAGCTQC C7CTCCCCAG 180 

AACACGTCAA CAAGCAAGCC ATCCAC7GAT AATCAGGGTA CCAGTACCCC CACTATTCCA 240 

ACTUTTACTG ATGACACAGC CAGTAAAAAT TTTTATAAAT ACAGAGTATG CAGTGCATCA 300 

TCTTCCTCTG GAGAACTATT CAGATTTGAC CTTGATCAGA CATGTCCAGA TACAAAAGAT 360 

AAAAAACATG TOGAAGQCAT CCTGCTOGTA CTAAAAAAGA ATATTGTCCC ATACATCTTC 420 

55 AAAGTGAGGA AATATAGAAA AATTGOCACC TCAGTGACAG TTTACABAGG GTGGTCCCAG 480 

GCAGCTCTTA CCAATAGGGA TGATATCAGC AGAGCCATAC CCTATAATGA AATTTCAATG 540 

ATAGATAGGA CCTATCATTG TT7CTCTQCT ATGGCAACAG TCATTAATGG GATTCTGAAC 600 

ACCTATATAG ACAGGGATTC TCAAAATAAG ItlGHUXC TCCAGCCAGT GGCCGGACTG 660 

AOGAGAACA TAAACAGATA CTTTAGTCAA CCTCTCATAT ATGCAGAACC TGGC'F G GTl 1 720 

60 CCAGGGATTT ATAGAGTSA6 AACAACTGTT AATTGTBAGG TTCTTGACAT GTATGCCCGC 780 

TCTCTOGAAC CATATACTCA CTTTATTACA GCTCT9GGBG ACACTATTGA AATCTCCCCA 640 

TTtrUICACA ACAATTCTCA ATQCACCACT GGTAATTCCA CCTCAAGGGA TGCCACAAAG 900 

GTATOGATAG AAGAAAATCA CCAAACTGTT GACTATGAAA GACGGGQGCA TCGCACTAAA 960 
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GATAAAAGAA TCTTICIAAA AGATGAGGAA 
GAGAGAGCTA TTTGTGA1TT TGTCATATQG 
CATAATCAGA GCTTTCACTT TGTOGCAAAT 
CAAGAAGAAA CGGAGCTACG TQGAAATACC 
5 AATGAAACTC TAGAAGAGAC AGTCAAAAAA 
GTAAAGTACT ATAAAACAAA TGGGGGACTA 
AATCPEJTCAG AACACACAAA CTACACTATT 
AGACAAAAAA GGTC7GTAGA TACAAAGACC 
CAGGTTCAAT ATCCCTATGA CCATTTAAGA 

10 ACCAAAACAT GGfTGCCGGGA ACAAAAAAAG 
ATTAACCCAG TGAGTGTCAT GGCAGCCATT 
GGAGATQCAT TCATQGTTTC TGAGTOCATC 
AAAAGTATCA GAACGGATGA TCCCAAGGTA 
1 1 ilillaAATA GTACTCCCAC CTTCAQGQGT 

15 ACAAACACAC ACGTGGAAAC TTGTAGACCA 
ATGACACACT ATTTTAAGGA CTATAAATTT 
ACCCTGGATA CATTTTTAAC TCTCAATTTA 
GTOGAACTIT ACAGTGAGAC 75AAAGAAAG 
TTTAGAGAGT ATAATTACTA CACACAGAAG 

20 ACCATTGACC TGAACAGGGA CAGACTAGTT 
GGAGACATTG GAAAAGTGGT GSTCAACACA 
ATAGTTGGTC GATTTGTCAG TTTTTTCACA 
CTTUTCATAG I IG I GUI III I G II G I M il 
GAGGCCCCCA TAAAAATGAT CTATCCAAAC 

25 CAGCCCCTAC CCGGAGAGGA GATTAAGCGC 
AGTGAGCACG GCAAATCTGA GGAAGAGGCT 
GGGGATGGCC TACAATTGCT GCGCAGGCGC 
AGTCCAGGCA ATGACACATC TGAGACACAC 

30 SEQ ID N0:7: 

ATBGGBSTCG QQQ9CG0GCC TCQCGTCGTC 
TBCCAQQQBG TOSCQCAAGA AGfl I G l GaC I 
CCAGAAGTCSG 7GGCCGAGSA GAACCCGGCC 
GCCTCGCCTA CQGGCGSAGA GATATTCAGG 

35 GAAGACAAGG ACCACATAGA GQQCATAGCT 
GTTTTTAATG TCAGAAAGTA TAGGAAGATC 
AGCGAGGATG CCATAACAAA CCAOACACG 
CAGATGATGG ACCACTATTA TCAGTGCTTT 
GTCAACACCT ACTATCACAG GGACGGGTGG 

40 GGTCTCXCCT CTAGCATAAC GOGCTATCAG 

AAccnnrcr ggtcttacac aacaagaacc 

GCGASATCCA TGAAACCATT TC Ki ll CUT 
TCGCCCTTTT TAAAAGAAAA TQGCACAGAG 
ATTCAACTQC TGAAAAACTA TXTGTCACA 

45 AACGCTACCA GAT7CTTTOC AATATTTGGG 
GAAAACAQCT CCTACTGTCA TTTAAT7TTA 
CAACACAATA GCAGTCTCCA TTTTATTGCC 
TTAGAAGAAG AGGCTAATTT TAACGAGACA 
GAAAT7CAAA AAAAGTTAAA AGAGGTTGAA 

50 GTCTATAAAA CAACAGGCAA TCTGTACATT 
CTASATACTC ATOCCAAGCT GTACAATCTC 
CCCACAACAT CTCCCAGGAG AAGAGGCAG6 
AATGGAGACA ACTCAACTAA GGAAGAGAGT 
GACAATCTCA GAAAGAGCAT CAACAGSSTG 

55 GAACAGTACA GGGCCTCGCT CATGTGGTAC 
ATGAGCGCCA TCTA7GGCAG GCCAGTGTCT 
TCAGATTCTA TCASTCTTSA CCAAAAGAGC 
GGCAAAGAAG ACCTGTGTTA CACCAGGCCT 
GAA C T G TI I G CTGGCCAGCT GQSTCCCAGS 

60 GAGGTCTGCC AGCACAGCTG CGAGCACTAC 
AAGGACTACT ACTATGTCAG TACCCTCAAC 
ATTACCCTGA AC ClGlCrCT GGTAGAGAAT 
AAAACAGAGA AAAQGCTCTC CAACGTCTTT 



TATACCATCT CCTGGAAAGC AGAAGATAGA 1020 

AAAACCTTTC CCAGGQCCAT ACAAACAATC 1080 

GAAGTCACAG CCAGCTTTTT AACATCCAAC 1140 

GAGATATTGA ATTCCATGAA TAGTACCATA 1200 

TTTAACAAAT CCCATATCAG AGATGGGGAG 1260 

TTCCTTATCT GBCAGGCAAT GAAACCCCTT 1320 

GAAAGGAATA ACAAGAOGG AAATAAATCA 1380 

TTCCAAGGCG CCAAGGGCCT GTCCACTGCC 1440 

ACAAGCATGA ATCACATOT AGAGGAATTA 1500 

GACAATCTAA TCTGSTATGA GCTGAGTAAA 1560 

TA7GGGAAAC CTGTOGCAGT GAAAGOCATG 1620 

AATGTTGACC AGGCAAGTGT CAATATCCAT 1680 

TGTTACTCCA GACCCCTGGT CACATTTAAA 1740 

CAGCTTOGAA CAAGGAATGA AATOTOCTC 1800 

ACAGCAGATC ATTATTTTTT TCTAAAGAAC 1860 

GTGAAGACAA TGGATACCAA TAACATATCC 1920 

ACTTTTATA6 ACAATATAGA TTTCAAGACA 1980 

ATGGCCAGTG CCCTCGACCT GGAGACGATG 2040 

CTTOCAAGTC TGAGAGAAGA TCTAGACAAC 2100 

AAAGATCTCT CTGAAATGAT GGCAGACCTT 2160 

TTCAGTOGCA TTCTCACTGT TTTTGGGTCT 2220 

AACCCCATTG GGGGCGTGAC GATCATCCTC 2280 

ATAGTCTCCA GGAGAACCAA TAACA7GAAC 2340 

ATTGACAAAG CCTCTGAGCA GGAGAACATT 2400 

ATOTCCTTC GAATGCACCA GCTCCAGCAA 2460 

AGCCATAAAC CAGGGXTCTT CCAACTATTG 2520 

GGGTATACTA GCTTACCAAC TTTTGACCCC 2580 

CAAAAATATG TTT 2623 



CTCTGTCTAT GGTGCGTUj C TSCG C TT C TC 60 

GAAACGACCA CCCCGTTCGC AACCCACAGA 120 

AACCCCTTTC TGCCGTTCAG GGTATGCGGG 180 

TTCCCCCTGG AGGAGAGCTG CCCCAACACG 240 

CTCATCTACA AGACCAACAT AGTBCCTTAT 300 

ATGACCTCGA CCACCATCTA CAAGGGTTGG 360 

AGGAGCTACG CCGTCCC CC T GTACGAGGTC 420 

AGOGCCGTAC AGGTGAACGA GGGGGGGCAC 480 

AACGAGACCG CCTTCCTCAA ACCGGCOGAT 540 

AGTCAACCAG AGGTCTACGC CACCCCCAGA 600 

ACAGTCAACT GOGAGGTGAC AGAGATCTCT 660 

GTGACGTCTG TTGGTGACAC TATAGAGATG 720 

CCAGAGAAAA TCTTGAAAAG ACCACACTCT 780 

AAGTACGG7G TQQGCT7GGG GCAGGCTGAT 840 

GACTATTCCC TGTCTTGGAA AGCCACCACT 900 

1GGAAGGGGT TTTCCAA7GC CATTCAAACT 960 

AATSATATAA CAGCCTCCTT CTCTACTCCT 1020 

TTTAAGTGTA TAT9GAACAA CACCCAAGAA 1080 

AAAACTCACA GACCTAACGG TACTGOGAAG 1140 

CTTTQGCAAC CGCTTATACA GATAGACCTG 1200 

ACAAACGCTA CAGCTICACC TACATCAACA 1260 

GATACTTCAA GTGTTAGTGG CGGfflQGAAAT 1320 

GTOGCGGCCT CCCAGGTTCA GTTTOCCTAT 1380 

TTGQGAGAGC TGTCCAGGGC ATGGTOCAGG 1440 

GAGCTCAGCA AGATCAACCC CACCAGOGTt 1500 

GCCAASFTCA TAGGGGACGT GGTGFCAGTG 1560 

GTGTTTGraC ACAAAAATAT GAAGGTGCCT 1620 

GTGGTCGXT TCAAGTTTAT CAATGQGAGC 1680 

AACGAGATTG TGC7GTCCAC CTCTCAQGTG 1740 

TTCCAGGCCG GGAACCAGAT GTACAAGTAC 1800 

CTGACTGACA TACCCACCa ACACACCATG 1860 

ATAGATTTTA AGGTGATTGA GCTCTATTCT 1920 

GACATCGAGA CCATGTTCAG GGAGTACAAC 1980 

40 
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TACTACACTC AGAACCTCAA CGGQCTCAGG AAGGACC1GG ATGACAGCAT AGATCATGGC 2040 

AGGGACAGCT TCATCCAGAC CCTGQGTCAC ATCATGCAGG ACCTGGGCAC CATAGGCAAG 2100 

6TGGTOGTCA ATCTGGCCAG CGGAGTGTTt TCCCTCTTTG GGAGCATAGT C7CQQQQGTC 2160 

ATAAGCTTTT TCAAAAATCC Cl H UU m ATQCTBCTCA TAGTCCTCAT CATAGCCQQG 2220 

5 GTAGTGGTGG TCTACCTGTT TA7GACCAGG TCCAGGAGCA TATACTCTGC CCCCATTAGA 2280 

ATQCTCTACC CGGGGGTGGA GAGGGCGGGC CAGGAGCGGG GCGCGCACCC GGTC7CAGAA 2340 

GACCAAATCA GGAACATCCT GA7GGGAATG CACCAATTTC AQCAGCQQCA GCGGGCGGAA 2400 

GAGGAGGCCC GACGAGAGGA AGAAGTAAAA GGAAAAAGAA CTCTCTHU A AGTGATAAGA 2460 

GACTCTGCGA CCAGCGT7CT GAGGAGGAGA AGAGGGGGTC GTOGGTACCA GCGCCTACAG 2520 

10 CGAGACGGGA GCGACGATGA GGGGGATTAT GAGCCATTGA GGCGACAAGA 7GGAGQCTAC 2580 

GACGACGTOG ACGTGGAGGC AGGCAOGGCG GATACCGGTG TGTAA 2625 

SEQ ID N0:8: 

ATGTACCCTA CAGTGAAAAG TATGAGAGTC GCCCACCTAA CCAATCTCCT AACCCTTCTG 60 

15 IG1CTGCTGI GCCACACGCA TCTCTACGTA TG1CAGCCAA CCACTCTGAG GCAGCCATCA 120 

GACA1GACCC CAGCCCAGGA CGCTCCAACA GAGACTCCCC CACCCCTCTC AACTAACACT 180 

AACAGAGGAT TTGAGTACTT TCGCGTGTGT GGGGTGQCTC CCACGGGGGA GACCTTCAGG 240 

TTTGATTTAG ACAAAACATG CCCCAGTACA CAAGATAAGA AGCATGTGGA GGGCATCTTC 300 

CTCGTCTATA AGATCAACAT CGTGCCCTAC ATCTTCAAAA TCAGGAGATA TAGAAAAATA 360 

20 ATTACTCAAC TGACCATCTG GCGAGGCCTA ACCACTAGTT CAGTCACTGG TAAATTTGAA 420 

ATQGCCACTC AGGCCCACGA GTGGGAAGTG GGCGACTTTG ACAGCATCTA TCASTGCTAC 480 

AATAGCGCCA CCATGGTOGT AAACAACGTC AGACAGGTGT ATGIGGACAG AGATGQGGTC 540 

AATAAAACTG TGAACATACG (XC I (il l GAT GGTCTAACAG GGAATATCCA AAGATACTTT 600 

AGTCAGCCCA CCCTTTATTC AGAACCTGGT TBGATGCCTG GCTTTTATCG TGTTCGAACC 660 

25 ACCGJTAACT GTGAAATTGT AGACATGGTG GCACGCTCCA TGGATDCCTA TAACTACATC 720 

GCTACCGCCC TGGGAGACAG CCTGGAGCTC TCCCCGTTTC AAACCTTTGA CAACACCAGC 780 

CAGTGTACTC CGCCTAAGAG AGCTGATATG AGGGTCAGGG AGGTtAAGAA TTACAAGTTT 840 

GTAGATTATA ATAACAGGGG AACTGCCCCC GCTGGACAAA GCAGGACCTT TCTAGAGACT 900 

CCCTCTOCCA CTTACTCCTC GAAAACAGCC ACCAGACAAA CTGCCACGTG CGACCTGGTG 960 

30 CACTGGAAAA CATTCCCTCG CGCCATCCAA ACTGCTCATC AACATAGCTA CCATT7TGTC 1020 

GCCAATGAAG TCACCGCCAC CTTCAATACA CCCCTGACTG AGGTAGAAAA TTTCACCAGC 1080 

ACGTATAQCT GCGTCAGTGA CCAGATCAAT AAGACCATCT CTGAATATAT CCAAAAGTTG 1140 

AACAACTCCT ACGTGGCCAG 7GGGAAAACA CAGTATTTCA AGACTGATOG TAACCTCTAC 1200 

CTCATCTOGC AACCACTCGA ACATCCAGAG ATTGAAGACA TAGAOGAGGA CAGCGACCCA 1260 

35 GAACCAACCC CCGCCCCACC AAAGTCCACA AGGAGAAAAA GAGAGGCAGC TGACAATGGA 1320 

AACTCAACAT CTGAGGTCTC AAAGGGCTCA GAAAATCCGC TCATTACGQC CCAAATTCAA 1380 

TTTGCCTATG ACAAGCTGAC CACCAGCGTC AACAAGGTGC TTGAGGAGTT GFCCAGGGOG 1440 

TOCTGTAGAG AACAGGTCAG AGACACCCTC ATCTGGTATC AGCTTAGCAA GGTGAACCCT 1500 

ACGAG7GTCA TGTCTOCCAT TTATGGAAAG ILIbl lti Uti CCAGGTACGT GGGOGAOGCC 1560 

40 ATATCTGTCA CAGACTGTAT CTATC7GGAC CAAAGTTCAG TCAACATCCA CCAGAGCTTG 1620 

CGGCTGCAGC ATGATAAAAC CACCTGCTAC TCGAGACCTA GAGTCACCTT CAAATTTATA 1680 

AACAGTACAG ACCCGCTAAC TGGCCAGTTG GGTCCTAGAA AAGAAATTAT CCTCTCCAAC 1740 

ACAAACATAG AAACA1GCAA GGATGAGAGT GAACACTACT TCATTCTGGG QGAATACATT 1800 

TACTATTATA AAAATTACAT TTT7GAAGAA AAGCTAAACC TCTCAAGCAT CGCTACCCTA 1860 

45 GACACATTTA TAGCCCTCAA TATCTCAT7T ATTGAAAATA TCGACTTCAA AACAGTAGAA 1920 

CTCTACTCCT CTACTGAAAG GAAACTCGCA TCGAGCGTCT TTSATATAGA ATCCATGTTT 1980 

AGGGAATATA ACTATTACAC CTACAGCCTC GCGGGCATTA AGAAGGACCT AGACAACACC 2040 

A1GGACTACA ATAGAGACAG ACTGGTTCAG GACCTGTCAG ACATGATGGC TGATCTGGGA 2100 

GACATTGGAA GATC7GTGGT GAATGTOGTC AGCTCGGTAG TCACATTT7T CACTASTATT 2160 

50 GTGACAGGGT TCATTAAATT OTTACCAAC CCTCTAGGGG GAATATTCAT TCTCCTAATT 2220 

ATPQGTGGAA TAATCTTCTT GGTGGTAGTC CTAAATAGAA GAAAOCACA GTTTCACGAT 2280 

GCACCCATCA AAATGCTGTA CCCTTCTCTT GAAAACTACG CTGCCAGACA GGCGCCACCT 2340 

CCCTATAGCG CATCACCTCC AGCTATAGAC AAAGAGGAAA TTAAGCGCAT ACTTTTCGGC 2400 

ATGCATCAGG TACACCAGGA AGAAAAGGAA GCACAGAAAC AACTAACCAA CT l l G G OU 2460 

55 ACTTTGTBQC AGAAAGCCAC AG6ATTCCTT AGAAATCGCC GGAAGGGATA CAGCCAACTT 2520 

CCTOQGAAG ATCAATCAAC T TCCCTCT 2548 

SEQ ID N0:9: 

ATGACTCGGC GTAGGGTOCT AAGCGTGGTC GTGCTGCTAG CCGCCCTGGC GTGCCGT t I C 60 

60 GGTGCGCAGA CCCCAGAGCA GCCCGCACCC CCCGCCACCA CGGTOCAGCC TACCGCCACG 120 

CGTCAQCAAA CCAGCTTTCC TTTCGGAGTC TGCGAGCTCT CCAGCCACGG CGA CLHa l I C 180 

CQLI IL IIU CGGACATCCA GIGIttlll G TTTGGCACGC GGGAGAATCA CACGGAGGGC 240 

CTGTTGATGG TGTTTAAAGA CAACATTATT CCCTAC7CGT TTAAQGTCCG CTIXTACACC 300 
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AAGATAGTGA CCAACATTCT CATCTACMT QQCTGGTACG CGSACTCCGT GACCAACCQG 360 

CACGAGGAGA AGTTCTCCGT TGACAQCTAC GAAACTGACC AGATGGATAC CATCTACCAG 420 

TXTACAACG CGGTCAAGAT GACAAAAGAT G9GCTGACGC QCGTUTATGT AGACCGCGAC 480 

GGAGTTAACA TCACCGTCAA CCTAAAGOCC ACCGQGQGCC TQQCCAACQG QGTQCQCCGC 540 

5 TACGCCAGCC AGACQGAGCT CTATCACQCC CCCGQGTGGT TGATATBGAC TTACttAACA 600 

AGAACTACC G TCAACTCCCT GATAAC7GAC ATGATGGDCA AGTCCAACAG CCCCTTCGAC 660 

TTCTTTCTGA CCACCACCGG QCAG/OGTO GAAATCTGCC CTTTCTATGA CGGGAAAAAT 720 

AAGGAAACCT TCCATGAGCG QGCASACTCC TTCCACGTCA GAACTAACTA CAAGATAGTG 780 

GACTACGACA ACCGAGGGAC GAACCCGCAA GQCGAACQCC GAGGCTTCCT GGACAAGGGC 840 

10 ACTTACACGC TATCTTQGAA GCTCGAGAAC AQGACAQCCT AOQCCCGCT TCAACACTGG 900 

CAAACCTTTG ACTCGACCAT CQCCAOGAA ACAGGGAAGT CAATACATTT TGTGACTCAC 960 

GAGQGCACCT CTAQCTTCGT GACCAACACA ACCGTOBBLA TAGAQCTCCC QSACGCCT7C 1020 

AAGTCCATCG AAGAGCAGGT GAACAAGACC ATOCATCAGA AGJACGAGGC CGTCCAGGAT 1080 

CGTTACAOGA AGGGCCAGGA AGCCATTACA TATTTTATAA CGAGCGGAGG ATTC7TATTA 1140 

15 QCTTQQCTAC CTCTGACCCC QCQCTCGTTS GCCACCGTCA AGAACCTGAC GGAGCTTACC 1200 

ACTCCGACTT CCTCACCCCC CAGCAGltCA TCQCCCCCAG CCCCATEXQC GGCCCGCGGG 1260 

AGCACCCCCG CCQCCGTTCT GAGGCGTCGG AQQCQQGATG CGGGGAACGC CACCACACCG 1320 

GTGCCCCCCA CQQCCCCCGG GAAGTCCCTG GQCACCCTCA ACAA7CCCGC CACCGTCCAG 1380 

ATCCAATTTG CCTACGACTC CCTOCQCCQC CAGATCAACC GCATQCTQQG AGACCTTOCG 1440 

20 CGGGCCTQGT GCCTGGAGCA GAAGAGGCAG AACA7QGTOC TGAGAGAACT AACCAAGATT 1500 

AATCCAACCA CC6TCATGTC CAGCATCTAC GGTAAGGCGG 7GQCGGCCAA GCGCC TQQQG 1560 

GATGTCATCT CASTCTCCCA GTGCGTQCCC GTTAAGCAGG CCACCGTCAC CCTGCGCAAG 1620 

AGCATGAGQG TCCCTGGCTC CGAGACCATG TQCTACTCGC GCCCCCTOST GTCCTTCAGC 1680 

TTTATCAACG ACACCAAGAC CTACGAGQGA CAGC7GQBCA CCGACAACGA GA7CTTCCTC 1740 

25 ACAAAAAAGA TCACGGAGGT GTOCCAQQCG ACCAG0CAGT ACTACTTCCA GTCCGQCAAC 1800 

GAGATCCACG TCTACAACGA CTACCACCAC TTTAAAACCA TCGAGCTQGA CGQCATTQCC 1860 

ACCCTQCAGA CCTTCATCTC ACTAAACACC TCCCTCATCG AGAACATTGA OTTSCCTCC 1920 

CTGGAGCTCT ACTCACGGGA CGAACAGCGT GCCTCCAACG 7CTTTCACCT G5AGGGCATC 1980 

TTCCGGSAGT ACAACTTCCA GQCGCAAAAC ATCGCCGQCC 7GCGGAAGGA TTTQGACAAT 2040 

30 GCAGTCTCAA ACGGAAGAAA TCAATTCGTC GACGGCCTGG GQGAACTTAT GGACAGTCTG 2100 

GGTAGCGTOG GTCACTCCAT CACCAACCTA GTCAGCACGG TOQGGSGT 1 1 GTTTAXAGC 2160 

CTGGTCTC1G GTTTCATCTC CTTCTTCAAA AACCCCTTCG GCGGCATQCT CATOQCTC 2220 

CTOG7GQCGG GGGTOGTCAT CCTGGTTATT TOCTCACGA GQCGCACGCG CCAGATCTCG 2280 

CAGCAGCCGG TGCAGATGCT CTACCCCGGG ATtGACGAGC TCGCTCAGCA ACATGCCTCT 2340 

35 GSTGAGQGTC CAGGCATTAA TCCCATTAGT AAGACAGAAT TACAAGCCAT CATCTTAGCG 2400 

CTGCATCAGC AAAACCAGGA GCAAAAGAGA GCAGCTCAGA GQQCGQCCGG ACCCTCAGTG 2460 

GCCAGCAGAG CATTGCAGGC AGCCAGGGAC CGTTTTCCAG GCCTACGCAG AAGA0GCTAT 2520 

CACGATCCAG AGACCGCCGC CGCACTGCTT GGGGAGGCAG AGACTGAGTT TT 2572 

40 SEQ ID NO: 10: 

ATQGAATCCA GGA7CTOGTO CCTQGTAGTC TOCGTTAACC TGTCTATCGT UblUbiai 60 

strracGgrn cctcttctag tacttcccat GCAACTTCTT CTACTCACAA TGGAAGCCAT 120 

ACTTCTCGTA CGACGTCTGC TGAAACCGGG TCAGTCTATT CTCAACACGT AACGTCTTCT 180 

GAAGCCGTCA GTCATAGAGC CAACGAGACT ATCTACAACA CTACCCTCAA GTACGGAGAT 240 

45 GTOGTOGGAG TCAACACTAC CAAGTACCCC TATCGCGTGT GTTCTATGGC CCAGGSTACG 300 

GA7CTTATTC GCTTTGAACG TAATATCA7C TGCACCTCGA TGAAGCCTAT CAATGAAGAC 360 

TTGGATGAGG GCATCATQGT GGTCTACAAG CGCAACATCG 7GGC8CACAC CTTTAAGGTA 420 

CGGGTCTACC AAAAGGTTTT GACGTTTCGT CGTAGCTACG CTTACATCTA CACCACTTAT 480 

CTOCTGGGCA GCAATACGGA ATACGFSGCG CCTCCTATST GQGAGATTCA TCACA1CAAC 540 

50 AAGTTTGCTC AA7GCTACAG TTCCTACAQC GGCGTTATAG GAGGCACGGT TTTCGTQQCA 600 

TATCATAGQG ACACTTATGA AAACAAAACC A7GCAATTAA TTCCCGACGA TTATTCCAAC 660 

ACCCACAGTA CCCGTTACGT GACGGTCAAG GATCAGTQGC ACAGCCGOGG CAGCACCTSG 720 

CTCTATDGTS A GACCTGTA A TCIGAACTCT ATQCTGACCA TCACTACTGC GOGCTCCAAG 780 

TATCCTTATC AM 1 1 1 1 IOC AACT7CCACG GGTCATGTOG TTTACATTTC TCCTTTCTAC 840 

55 AACGGAACCA ATCGCAATQC CAGCTACTTT GGAGAAAAOG CCGACAAGTT TTTCATTTTC 900 

CCGAACTACA CCATCGTTTC CGACTTTGGA AGACCCAA0G CTGOGCCAGA AACCCATAGG 960 

TTOGTGGCTT TTCTCGAACG TGCCGACTCG GTGATCTCTT GGGATATACA GGACGAGAAG 1020 

AATGTCACCT GCCAGCTCAC CTTCTOGGAA GCCTCGGAAC GTACTATCCG TTCCGAAGCC 1080 

GAAGACTCGT ACCACTTTTC TTCTOCCAAA ATGACTOCAA U I IIUU I l TAAGAAACAA 1140 

60 GAAGTGAACA TGTCCGACTC CG0GC7GGAC TGCGTACGTG ATGAGGCTAT AAATAAGTTA 1200 

CAGCAGATTT TCAATACTTC ATACAATCAA ACATATGAAA AATACGGAAA CG T G T ICG I C 1260 

T7CGAAACCA UUjljUjblll Ubl Uil b l lt TQXAAGGCA TCAAGCAAAA ATCTTTGGTG 1320 

GAATTGGAAC GTTTGGCCAA TCGATCCAGT CTGAATATCA CTCATAGGAC CAGAAGAAGT 1380 
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ACGAGTGACA ATAATACAAC TCATTTGTCC 
GCCCAGC7BC AGTTCACCTA TGACACGTTG 
ATCQCAGAAG CCTQGTBTGT GGATCAACGG 
AAGATCAACC CGTCAHXAT TCTCTCGGCC 
5 ATGGGTGATG ICnGQGCCI GGCCAGCTOC 
CTGCGTGATA TCAACGTCAA GGAATCGCCA 
TTTAATTTCG CCAACAGCTC GTACGTGCAG 
CTGTTGGGCA ACCACCGCAC TCAQGAATCT 
GGGAACTCGG CCTACGAGTA CGTGGACTAC 

10 ATCTCCACCG 7CGACAGCAT GATCGCCCTG 
AGGGTACTGG AACTTTACTC GCAGAAAGAG 
GAGATCATGC GCGAATTCAA CTCGTACAAG 
GTCGACCGGC TACCQCCCTA CCTCAAGGGT 
GCGGGAAAGG CGGTTGGCGT AQCCATTQGG 

15 GAAGGCGTTG CCACCTTCCT CAAAAACCOC 
ATAGOOGTAG TCATTATCAC TTATTTGATC 
CCGCTGCAGA ACCTCTTTCC CTATCTGGTG 
AGCACCAAAG ACACGTCGTT AC AGQCTCCG 
GGTCGCAAAG GACGGGGACC ACCGTCGTCT 

20 AACGAGCAGG CTTACCAGAT GCTTCTGGCC 
CAGCAGAACG GTACAGATTC TTTGGACGGA 
CCTAACCTQC TAGACCGGCT GCGACATCGC 
GACGAAGAAG AGAACGTCTG AA 

25 SEQ ID N0:11: 

ATGAGCAAGA TGAGAGTATT ATTCCTGGCT 
TATTGCGATT CGGATGATTA TATCAGAGCG 
TCTTCGATTC CCAAW3GCAC TGATTTGAJG 
TATAAGTCTA ATOCAAAGAT GTCGGAGGGT 

30 AOTACACTT TTCCASTGAG AACATATAAA 
GAIGIGGGIG IGUIIIATTT TCTGGATCGG 
GAAGCAAATT TASTTA ATTC TCGTGOGCAG 
GATGGTACGG ItilllAGfTGC CTATCATGAG 
TTTCCTCTGA ATTTCAAGTC TGTTACTAAT 

35 TTTGCAAGGS GTCCTTTGTG GCTCTATTCT 
GAGGCTACGG CTAAGGCGAA ATATCOGTTT 
GTGGAADQGT CTIXGTICI I OGACGGTTCA 
AAATTGACAA TOTOGAAAA CTATACTATG 
GCTACTACGT TAGTAAGGAA GATCGCTTTT 

40 GAAATCAAGG AAGAGAATGA ATCGGTGTGT 
GGGCTTCGAG CGGAGACGGA TGAGACTTAT 
TTCGTCG C CT CCAAGGAGTC TTTAAATCTT 
GAATTTGAGA AGATAATTAC AGATGTCTAT 
AACGGTAGTT ATCAAATTTT TAAGACTACG 

45 GTGCAAAAAT C7CTTATGGT TCTTGAGCAG 
TTGGTGGATG TCAAGTCTAG ACATGATATT 
ACTT7GAAAG ATTATATCAA CGATGCCTTC 
CAAAAACGAA CGATAACGAT GTTGCACGAA 
TC7GAGGTTT ACGGTCGT CC GATATCTGCA 

50 AAATGCATAG AAGTTAATCA ATCATCCGTT 
GGGAAGGGAG TAAGGAGTGA AACGATGTCT 
GTGAAOCCA CGCCTGAGGT TGTCCTTGGT 
GGTGATCATA GGACAGAGGA ATGTGAGATA 
CATGCACACG TGTATACCGA TTATACGCAT 

55 GTATTGGATG CTTITATTAG ACTAAAGATC 
CTTGATTTAT ATTCGCCGGA CGAATTCAGT 
CTTCGTGAAT ATAACTCATA TAAGAflCGCA 
AATACGCCGT CGTATGFCAA TGGGATTAAT 
ACTGGATTGG GCTCGGTTAT AAGTGTTACG 

60 G I Uj I U I C I I TTTTAAAAAA TCCATTCGGG 
GTTCTCGTTA TAATAATTGT (j E j llll Ui ll 
GACATGATCT TTCCTTATBC CACCAATCCG 
ACTCTCGTCA AGACGOCTAG TGTTAAAGAT 



AGCATGGAAT CGGTGCACAA TCTQGTCTAC 1440 

CGCGGTTACA TCAACCGGGC GCTGGOGCAA 1500 

CGCACCCTAG AGGTCTTCAA GGAACTCAGC 1560 

ATTTACAACA AACOGATTGC CGOGDGTTI C 1620 

GTCACCATCA ACCAAACCAG CGTCAAGGTG 1680 

GGACGCTGCT ACTCACGACC CGTOGTCATC 1740 

TACGGTCAAC TGGGCGAGGA CAACGAAATC 1800 

CAGCTTCCCA GCCTCAAGAT CTTCATCGCC 1860 

CTCTTCAAAC GCA7GATTCA CCTCAGCAGT 1920 

GATATCGACC CGCTGGAAAA TACCGACTTC 1980 

CIGU3TTCCA GCAACGTTTT 7GACCTCGAA 2040 

CAGCGQGTAA ASTACGTGGA GGACAAGGTA 2100 

C7GGACGACC 1CATSAGCGG CCTOGQCGCC 2160 

GCOGTGGGTG GCGCGGTOGC CTCGGTGGTC 2220 

TTCGGAGCCT TCACCA7CAT CCTOTGGCC 2280 

TATACTCGAC AGCGGCGTCT 67BCACGCAG 2340 

TCCGCCGACG GGACCACCGT GACGTPGGG C 2400 

CCTTCCTACG AGGAAAGTGT TTATAATTCT 2460 

GATGCATCCA CGGCGSCTCC GCCTTACACC 2520 

CTGGOCCGTC TGGACGCAGA GCAGOGAGGG 2580 

CAGACTGGCA CGCAGGACAA GGGACAGAAG 2640 

AAAAACGGCT ACAGACACTT GAAAGACTCC 2700 

2722 



GTCTTTTTGA TGAATAGTCT TTTAATCATA 60 

GGCTATAATC ACAAATATCC TTTTOGGATT 120 

CGSTTCGACA GAGATATTTC G7GTTCGCCA 180 

TTTTTCATCA TTTACAAAAC AAATATCGAG 240 

AACGAGC7GA CGTTCCAAAC CAGTTACCGT 300 

ACGGTGATGG GTTTGGCCAT GCCGSTGTAC 360 

TGTTATTCAG CCGTAGCGAT AAAACGACCC 420 

GATAATAATA AAAACGAAAC TCTAGAATTA 480 

AAAAGATTTA TCACTACGAA AGAACCCTAC 540 

ACA1GGAGGT CTCTCAATTG TATTCTGAGG 600 

AGTTACTT7G CTTTGACGAC 7GGTCAAATC 660 

AACGGTAAAC ATTTTGCAGA GCOGTTAGAA 720 

ATAGAAGATC TAATGAATGG TATGAATGGG 780 

CTOGAGAAAG GGGATACTTT GHI I dim 840 

ATGCTAAAGC ACTGGACTAC GGTGACTCAC 900 

CACTT7ATTT CTAAGGAGTT GACAGCCGCT 960 

ACCGATCCCA AACAAACGTG TATTAAGAAT 1020 

ATCTCAGATT ATAATGATGA CTACAGCATG 1080 

GGAGATCTGA TTTTGATTTG GCAGCCTCTT 1140 

GGTTCAGTAA AOTACGTAG GAGGOGAGAT 1200 

CTTTA7GTGC AATTACAGTA CCTUTATGAT 1260 

GGGAATTTGG CAGAATCTTG GTOCCTCGAT 1320 

CTTAGTAAGA TCAGTCCATC GAGTATCGTG 1380 

CAGTTGCATG GTGATGTGTT AGCTATCTCG 1440 

CAGCTTTATA AGAGTATGCG GGTCGTCGAT 1500 

TATAATCGQC CCTTOTGAC GTTTAGCTTT 1560 

CAGCTAGGGT TAGATAATGA GATTCTCTTG 1620 

CCTAGTACAA AGATATTTCT ATCTGGAAAT 1680 

ACGAATTCGA CGCCCATAGA AGACATTGAG 1740 

GACCCTCTCG AAAATGCTCA TTTTAAACTA 1800 

ASAGCAAACG TTTTCGATTT AGAGAATATT 1860 

CTATATACTA TAGAAGCTAA AATTOCTACT 1920 

I t ll MIIA C AAGGGCTTGG GQCXATAGGC 1980 

GCAGGAGCAC TTGGGGATAT TGTGQGTGGA 2040 

GGIGGIC1CA TGTTGATT7T AGCGATAGTA 2100 

AGACAAAAAC A TCTB C T1 A G TAAGCCTATT 2160 

GTGACTACTG TGTCCAGTGT TACGGGGACC 2220 

GCTGACGGGG GCACATCTCT TGCGG1TTCG 2280 



93 
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GAAAAAGAGG AGGGTATGGC "PGACGTCAGT GGACAAATAA GTOGTCA7GA ATAT7CACAA 2340 

GAAGATGCTT TAAAAATGCT CAAGGCCATA AAGTCTTTAG ACGAGTCCTA CAGAAGAAAA 2400 

CCTTCGTUTT CTGAGTCTCA TQCCTCAAAA CCTAGTT7GA TAGACAGGAT CAGGTATAGA 2460 

QGTTATAAGA GTGTAAATGT AGAAGAAGCG TCA 2493 

5 

SEQ ID NO : 12: 

ATGTTTCTTA CQGSCGGTTGT GTCUilUU CCAAGCTCGT TTTATGAGAG TTTACAAGTA 60 

GAGCCCACAC AATCAGAAGA TATAACCCGG TCTQCTCATC TGGGCGATGG TGATGAAATC 120 

AGAGAAQCTA TACACAAGTC CCAGGACGCC GAAACAAAAC CCACGXTTTA CGTCTQCCCA 180 

10 CCGCCAACAG GCTCCACAAT CGTACGATTA GAACCAACTC GGACATCTCC GGATTATCAC 240 

CTTOGTAAAA ACTTTACAGA GQSTATTOCT GIIUIIIATA AASAAAACAT TCCAGCGTAC 300 

AAGTTTAAGG CGACGGTATA TTACAAAGAT GTTATCGTTA GCACQGC6TG GGCCGGAAGT 360 

TCTTATACGC AAATTACTAA TAGATA7BCG GATAGGGTAC CAATTCCCGT TTCAGAGATC 420 

ACGGACACCA TTCATAAGTT TOGCAAGTTGT TUTTCTAAAG CAACGTAC6T ACGAAATAAC 480 

15 CACAAAGTTC AAGCCTTTAA TGAGGATAAA AATCCACAGG ATATQCCTCT AATCGCATCA 540 

AAATATAA1T CTGTOQGATC CAAAGCATT5G CATACTACCA ATGACACGTA CATQGTTQCC 600 

GGAACCCCCG GAACATATAG GACGGGCACG TCGGTCAATT GCATCATTGA GGAAGTTGAA 660 

GCCAGATCAA TATTCCCTTA TGATAGTTTT GGACTTTCCA CGGGAGATAT AATATACATG 720 

TCCCCGTTTT TTQGCCTACG GGATQGTOCA TACAGAGAAC ATTCCAATTA TGCAATGGAT 780 

20 CGTTTTCACC AGTTTGAGGG TTATAGACAA AGGGATCT7G ACACTAGAGC ATTACTOGAA 840 

CCTGCAGCGC GGAACTTTTT AGTCACGCCT CATTTAACGG TTGCTTOGAA CTGGAAGCCA 900 

AAACGAACGG AAGTTTCTTC GCTTSTCAAS TGGCGTGAGG TTGAAGACGT AGTTCGCGAT 960 

GAGTATGCAC ACAATTTTC6 CTTTACAATG AAAACACTTT CTACCACGTT TATAAGTCAA 1020 

ACAAACGAGT TTAATCTTAA CCAAATCCAT CTCAGTCAAT GTGTAAAGGA GGAAGCCCGG 1080 

25 GCTATTATTA ACCGGATCTA TACAACCAGA TACAACTCAT CTCATGTTAG AACCGGGGAT 1140 

ATCCAGACCT ACCTTGCCAG AQOGGOSTTT Gl i mUi l G I TTCAACCCCT GC7GAGCAAT 1200 

TCCCTCGCCC GTCTCTATCT CCAAGAATTC GTCCGTOAAA ACACTAA7CA TTCACCACAA 1260 

AAACACCCGA CTCGAAATAC CAGATCCCGA CGAAGCGTQC CAGTTGAGTT GCGTQCCAAT 1320 

AGAACAATAA CAACCACCTC ATCGGTQGAA TTTQCTATQC TCCAGTTTAC ATATGACCAC 1380 

30 ATTCAAGAGC ATSTTAATGA AATGT7GGCA CGTATTPCCT CGTCGTOGTG CCAGCTACAA 1440 

AAT CQOGAA C GCGCCCTTTG GAGCGGACTA TTTCCAATTA ACCCAttTGC TTTAGGGAGC 1500 

ACCATTTTQG ATCAACGTGT TAAAGC7CGT ATTUTCQQCG ACGTTATCTC CGTTTCTAAT 1560 

TGTCCAGAAC TGGGATC AGA TACACGCATT ATACTTCAAA ACTCTATGAG GGTATCTGGT 1620 

ASTACTACGC GTTCTTATAG CCGTCCT7TA ATTTCAATAG TTAGTTTAAA TGBGTCCGGG 1680 

35 ACGGTGGAGG GCCAGCTTQG AACAGATAAC GAGTTAATTA TGTCCAGAGA TCTCTTAGAA 1740 

CCATGGGTGG CTAATCACAA GCGATA7TTT CTATTTGGGC ATCACTACGT ATATTATGAG 1800 

GATTATCGTT ACGTCCGTGA AATCGCAGTC CATGATCTGG GAATCATTAG CAC1TACGTA 1860 

GATTTAAACT TAACACTTCT TAAAGATAGA GAGTTTATQC CGCTGCAAGT ATATACAAGA 1920 

GACGAGCTGC GGGATACAGG ATTACTAGAC TACAGTGAAA TTCAACGCCG AAATCAAATG 1980 

40 CATTCGCTGC GTTTTTATGA CATAGACAAG GTTGTGCAAT ATGATAGCGG AACGGCCATT 2040 

ATGCAGGGCA TOGCTCAGTT TTTCCAGGGA CTTGGGACCG GGGGCCAGGC CGTTOGACAT 2100 

GTGGTTCTTG GGGCCACGGG AGCGCTGCTT TCCACCGXAC ACGGATTTAC CACGTTTTTA 2160 

TCTAACCCAT TTCGGGCATT GGCCGTGGGA TTATTGGTTT TGGCGGGACT GGTAGCGGCC 2220 

TTTTTTGCGT ACCGGTACGT GCTTAAACTT AAAACAAGCC CGATGAAGGC ATTATATCCA 2280 

45 CTCACAACCA AGGGGTTAAA ACAGTTACCG GAAGGAATGG ATCCCTTTGC CGAGAAACCC 2340 

AACGCTACTG ATACCCCAAT AGAAGAAATT GGCGACTCAC AAAACACTGA ACCGTCGGTA 2400 

AATAGCGGGT TTGATCCCGA TAAATTTCGA GAAGCCCAGG AAATGATTAA ATATATGACG 2460 

TTAGTATCTG CGGCTGAGCG CCAAGAATCT AAAGCCCGCA AAAAAAATAA GACTAGCGCC 2520 

CTTTTAACTT CACGTCTTAC CGGCCTTGCT TTACGAAATC GCCGAGGATA CTCCCGTCTT 2580 

50 CGCACCGAGA ATGTAACGGG GGTGTAAA v 2608 

SEQ ID NO: 13: 

ATGCGCCAGG GCGCCGCGCG GGQGTCCCGG UflllUaK K TATGGGCGCT U l b UU m i G 60 

AGGCTGGGGG TCCTGGTGGC GTCGGCGGCT CCGAGTTCCC CCGGWCGCC TGGGGTCGCG 120 

55 GCCGCGACCC AGGCGGCGAA CGGSGGACCT GCCACTCCGG CGCCGCCCGC CCCTBQCCCC 180 

QCCCCAACQG GGGACACGAA ACCGAAGAAG AACAAAAAAC CGAAAAACCC ACCGCCGCCG 240 

CGCCCCGCCG GGGACAAOGC GACCGTCGCC GCGGGCCACG CCACCCTOCG CGAGCACCTG 300 

CGGGACATCA AGGCGGAGAA CACCGATGCA AACTTTTACG TGTGCCCACC CCCCACGGGC 360 

GCCACGGTGG 1GCAGTT0GA GCAGCCGCGC CGCTBCCCGA CCCGGCCCGA GGGTCAGAAC 420 

60 TACACGGAGG GCATGGOGGT GGTCTTCAAG GAGAACATCG CCCOGTACAA GTTCAAGGCC 480 

ACCA7GTACT ACAAAGACGT CACCGTTTCG CAGGTOQGT TCGGCCACCG CTACTCCCAG 540 

TTTATGGGGA TCTTTGAGGA CCGCGCCCCC GIUXtllLU AGGAGGTCAT CGACAAGATC 600 

AACGCCAAGG UU GtUU I Oi GTCCACGGCC AAGTAOGTGC GCAACAACCT GGAGACCACC 660 

9V 
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QCGTTTCACC GGGACGACCA CGAGACCGAC 
CGCACGAGCC GGGGCTGGCA CACCACCGAC 
TTCCACCGGT ACGGGACGAC GGTAAACTGC 
TACCCGTACG ACGAGTTTGT GCTGGCGACT 
5 GGCTACCGGG AQGGGTCGCA CACCGAACAC 
GTTGACGGCT TCTACGCGCG CGACCTCACC 
CGGAACC1GC TCACGACCCC CAAGTTCAGC 
TCQGTOGCA CCATGACCAA GTOGCAGGAG 
QGCTCCTTCC GATTCTCCTC CGACGCCATA 

10 TACCGGCTCT CGCGGGTGGA CCTGQGGGAC 
GACCGCATCT TCGCCCGCAG GTACAACGCG 
TACCTGGCCA ATGGQQQCTT TCTGATCGCG 
GAGCTCTACG TGCGGGAACA CCTCCGAGAG 
CCGCCGCCCG QQQCCAQCQC CAACGCGTCC 

15 GAGTTCGCCC GGCTGCAGTT TACGTACAAC 
GGCCGGGTTG CCATCGCGTG G1GGGAGCTG 
GCCCGCAAGC TGAACCCCAA CQCCATCQCC 
CGGATGCTCG GCGACGTCAT QQCCGTCTCC 
ATCGTCCAAA ACTCGATGCG CATCAGCTCG 

20 GTCAQCTTTC GGTAOGAAGA CCAGGQCCCG 
GAGCTQCGQC TGACGCGCGA TGCGATCGAG 
ACCTTCGGTG GGGGCTACGT GTACTTCGAG 
GCCGACATCA CCACCGTCAG CACCTTCATC 
GAGTTTGTCC CCCTOGAGGT 6TACACC0GC 

25 TACACGGAGG TCCAGCQCCG CAACCAGCTG 
GTCATCCACG CCGACQCCAA CGCCGCCATG 
ATQGQCGACC TGGGGGGCGC QGTC8QCAAG 
TCQQCCGTGT CGGGCGTGTC CTCCTTCATG 
C7GTTQGTCC TGGCCGGCCT QQCQQCQQCT 

30 CAGAGCAACC CCATGAAGGC CCTGTACCCG 
AACCCGGACG CGTCCGGGGA GGGCGAGGAG 
GAGQCCCGQG AGATCATACG GTACATQQCC 
AAQQCCAAGA AGAAQQQCAC GAQCQCQCTG 
CGCAAGCGCC QCAACACCAA CTACACCCAA 

35 GACGACCTGT GAC 



SEQ ID N0:14: 





Met 


Val 


Pro 


Asn 


Lys 


His 


Leu 




Ala 


Cys 


Gly 


Gin 


Thr 


Thr 


Pro 


40 


Gin 


Ala 


He 


Tyr 


Gin 


Glu 


Tyr 




Thr 


Thr 


Gly 


Glu 


Leu 


Phe 


Arg 




Thr 


Glu 


Asp 


Lys 


Val 


His 


Lys 




Asn 


He 


Val 


Pro 


Tyr 


He 


Phe 




Thr 


Ser 


Val 


Arg 


lie 


Phe 


Asn 


45 


Thr 


Asn 


Lys 


Trp 


Glu 


Leu 


Ser 




He 


Net 


Asp 


Lys 


Thr 


Tyr 


Gin 




Asn 


Gly 


Met 


Leu 


Asn 


Ser 


Tyr 




Val 


Asp 


Leu 


Lys 


Pro 


Val 


Asp 




lie 


Ser 


Gin 


Pro 


Lys 


Val 


Phe 


50 


Tyr 


Arg 


Thr 


Arg 


Thr 


Thr 


Val 




Arg 


Ser 


Ala 


Asp 


Pro 


Tyr 


Thr 




Val 


Glu 


Val 


Ser 


Pro 


Phe 


Cys 




Thr 


Asp 


Val 


Leu 


Ser 


Val 


Gin 




Tyr 


Gly 


Asn 


Arg 


Ala 


Thr 


Ser 


55 


His 


Thr 


Leu 


Asp 


Tyr 


Ser 


Val 




Ser 


Val 


Cys 


Ser 


Met 


Val 


Phe 




Thr 


Glu 


His 


Asp 


Leu 


Thr 


Tyr 




Gly 


Phe 


Ser 


Thr 


Val 


Lys 


Glu 




Asn 


Cys 


Leu 


Met 


Thr 


His 


He 


60 


Arg 


Val 


Asn 


Asn 


Thr 


His 


Thr 




Thr 


Glu 


61y 


Gly 


Met 


He 


Leu 




Leu 


Glu 


Glu 


Ala 


Met 


Leu 


Glu 




Ala 


Pro 


Thr 


Ser 


Ser 


Ser 


Arg 



A7GGAGCTCA AACCGGCCAA CGCCGCGACC 720 

CTCAAGTACA ACCCCTOCG GGTGGAGGCG 780 

ATCGTCGAGG AGGTGGACGC GCGOCGGTG 840 

GGCGACTTTG TGTACATGTC CCCGTTTTAC 900 

ACCAGCTACG CCGCCGACCG CTTCAAGCAG 960 

ACCAAGGCCC GGGCCACGGC GCCGACCACC 1020 

GTGQCCTQQG AC7GGGTGCC AAAGCGCCCG 1080 

GTGSACGAGA TGCTGCGCTC CGAGTACGGC 1140 

TCCACCACCT 7CACCACCAA CCTGACCGAG 1200 

TGCATCGGCA AGGACGCCCG CGAC8CCATG 1260 

ACGCACATCA AGGTGGGCCA GCCGCAGTAC 1320 

TACCAGCCCC TTCTCAGCAA CACGCTCGCG 1380 

CAGAGCCGCA AGCCCCCAAA CCCCACGCCC 1440 

GTGGAGCGCA TCAAGACCAC CTCCTCCATC 1500 

CACATACAGC GCCATGTCAA CGATATGTTG 1560 

CAGAATCACG AGCTCACCCT GTGGAACGAG 1620 

TCGGCCACCG TOflCCGfiCG GGTGAGCGCG 1680 

ACGTGCGTGC CGGTUGCCGC GGACAACGTG 1740 

CGQCCCQGBG CCTQCTACAG CCGCCCCCTG 1800 

TTGG1CGAGG GGCAGGTGGG GGAGAACAAC 1860 

CCGTOCACCG TGGGACACCG GCGCTACTTC 1920 

GAGTACGCGT ACTCCCACCA GCTGAGCCGC 1980 

GACCTCAACA TCACCATGCT GGAGGATCAC 2040 

CACGAGATCA AGGACAGCGG CCTGCTGGAC 2100 

CACGACCTGC GCTTCGCCGA CATCGACACG 2160 

TTCGCGGGCC TOGGCGCGTT CTTCGAGGGG 2220 

GTCGTGATGG GCATCGTOGG CGGCGTQGTA 2280 

TCCAACCCCT TTGGGGCGCT GGCCGTGGGT 2340 

I l l limXI TTCGCTACGT CATGCGGC7G 2400 

CTAACCACCA AGGAGCTCAA GAACCCCACC 2460 

GGCGGCGACT TTGACGAGGC CAAGCTAGCC 2520 

CTOGT5TCTG CCATGGAGCG CACGGAACAC 2580 

CTCAGCGCCA AGGTCACCGA CATGGTCATG 2640 

GTTCCCAACA AAGACGGTGA CGCCGACGAG 2700 

2713 



Leu Leu He He Leu Ser Phe Ser Thr 
Thr Thr Ala Val Glu Lys Asn Lys Thr 
Phe Lys Tyr Arg Val Cys Ser Ala Ser 
Phe Asp Leu Asp Arg Thr Cys Pro Ser 
Glu Gly He Leu Leu Val Tyr Lys Lys 
Lys Val Arg Arg Tyr Lys Lys He Thr 
Gly Trp Thr Arg Glu Gly Val Ala He 
Arg Ala Val Pro Lys Tyr Glu He Asp 
Cys His Asn Cys Met Gin He Glu Val 
Tyr Asp Arg Asp Gly Asn Asn Lys Thr 
Gly Leu Thr Gly Ala He Thr Arg Tyr 
Ala Asp Pro Gly Trp Leu Trp Gly Thr 
Asn Cys Glu He Val Asp Met Phe Ala 
Tyr Phe Val Thr Ala Leu Gly Asp Thr 
Asp Val Asp Asn Ser Cys Pro Asn Ala 
He Asp Leu Asn His Thr Val Val Asp 
Gin Gin His Lys Lys Arg He Phe Ala 
Ser Trp Glu Ala Val Asn Lys Ser Ala 
Trp Lys Ser Phe Gin Arg Ala He Gin 
His Phe He Ala Asn Glu He Thr Ala 
Pro Leu Ala Asn Phe Thr Ser Asp Tyr 
Asn Thr Thr Leu Glu Asp Lys He Ala 
Pro Asn Gly Thr Ala Glu Tyr Tyr Gin 
Val Trp Gin Pro Leu He Ala He Glu 
Ala Thr Thr Ser Pro Val Thr Pro Ser 
Ser Lys Arg Ala He Arg Ser He Arg 

is 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Asp Val 
Tyr Ala 
Leu Ala 
Tyr Glu 
Gly Lys 
Glu Cys 
Lys Thr 
Lys Phe 
Asn Glu 
Ala Glu 
Tyr Val 
Thr Phe 
Ala Val 
Asp Leu 
Ser Gly 
Arg lie 
Gly Lys 
Gly He 
Met Phe 
Leu Val 
Tyr Pro 
Glu Thr 
Ala Tyr 
Arg Gin 
He Ser 



Ser Ala 
Tyr Asp 
He Thr 
He Ala 
Pro Val 
lie Asn 
Glu Asn 
Val Asn 
He Leu 
Thr Phe 
His Val 
Leu Ala 
Glu Leu 
Glu Thr 
Leu Arg 
He Gin 
Val He 
Val Thr 
Thr Phe 
Arg Arg 
Asp Val 
He Lys 
Lys Lys 
Ala Ala 
Thr Glu 



Gly Ser 
Lys Leu 
Trp Cys 
Lys He 
Ser Arg 
Val Asp 
Asn Asp 
Ser Ser 
Leu Ser 
Phe Thr 
Glu Thr 
Leu Asn 
Tyr Ser 
Met Phe 
Lys Asp 
Asp Phe 
Val Asn 
Gly He 
Leu Leu 
Thr Asn 
Glu Lys 
Gin He 
Lys Glu 
Glu Thr 
Asp Lys 



Glu Asn 
Arg Gin 
Arg Glu 
Asn Pro 
Lys Ala 
Gin Ser 
He Cys 
Gin Leu 
Glu Ser 
Ala Lys 
Leu Pro 
Leu Thr 
Ser Gly 
Arg Glu 
Phe Asp 
Ser Glu 
Val Ala 
Leu Asn 
He Gly 
Asn Met 
Ser Lys 
Leu Leu 
Glu Gin 
Phe Leu 
He Val 



Asn Val 
Ser He 
Gin Val 
Thr Ser 
Leu Gly 
Ser Val 
Tyr Ser 
Phe Lys 
Leu Val 
Asn Glu 
Val Asn 
Phe He 
Glu Arg 
Tyr Asn 
Asn Ser 
He Leu 
Ser Gly 
Phe He 
Ala Val 
Ser Gin 
Ser Thr 
Gly Met 
Arg Ala 
Arg Lys 



Phe Leu Ser 
Asn Asn Val 
Arg Gin Thr 
Val Met Thr 
Asp Val He 
Ser He His 
Arg Pro Pro 
Gly Gin Leu 
Glu Asn Cys 
Thr Tyr His 
Asn lie Ser 
Glu Asn He 
Lys Leu Ala 
Tyr Tyr Ala 
Gin Arg Asn 
Ala Asp Leu 
Ala Phe Ser 
Lys Asn Pro 
He He Leu 
Ala Pro He 
Val Thr Pro 
His Asn Met 
Ala Arg Pro 
Arg Ser Gly 



Gin He Gin 
Leu Glu Glu 
Met Val Trp 
Ala He Tyr 
Ser Val Thr 
Lys Ser Leu 
Val Thr Phe 
Gly Ala Arg 
His Gin Asn 
Phe Lys Asn 
Thr Leu Asp 
Asp Phe Lys 
Asn Val Phe 
Gin Ser He 
Asn Arg Asp 
Gly Ser He 
Leu Phe Gly 
Leu Gly Gly 
Val He Leu 
Arg Met He 
Met Glu Pro 
Gin Gin Glu 
Ser He Phe 
Tyr Lys Gin 



SEQ ID NO: 15 
Met Tyr 
Phe Asn 
Ser Thr 
Thr Ser 
Ser Lys 
Thr Val 
Cys Ser 
Gin Thr 
Leu Val 
Tyr Arg 
Ala Ala 
Glu He 
Thr Val 
Asn Lys 
Asn Arg 
Pro Gly 
Met Tyr 
Gly Asp 
Thr Thr 
Glu Asn 
Asp Lys 
Ala Glu 
Phe Pro 
Ala Asn 
Glu Leu 
Asn Glu 
Arg Asp 
He Trp 
Thr He 
Ser Val 
Gin Val 
Leu Glu 
Leu Met 
Ala He 
Met Val 
Lys Ser 



Tyr Lys 
Gin Thr 
Thr Ser 
Thr Asn 
Pro Ser 
Thr Asp 
Ala Ser 
Cys Pro 
Leu Lys 
Lys He 
Val Thr 
Ser Met 
He Asn 
Ser Val 
Tyr Phe 
He Tyr 
Ala Arg 
Thr He 
Gly Asn 
His Gin 
Arg He 
Asp Arg 
Arg Ala 
Glu Val 
Arg Gly 
Thr Leu 
Gly Glu 
Gin Ala 
Glu Arg 
Asp Thr 
Gin Tyr 
Glu Leu 
Trp Tyr 
Tyr Gly 
Ser Glu 
Met Arg 



Thr lie Leu 
Thr Thr His 
Ser Thr Thr 
Ser Ser Leu 
Thr Asp Asn 
Asp Thr Ala 
Ser Ser Ser 
Asp Thr Lys 
Lys Asn He 
Ala Thr Ser 
Asn Arg Asp 
He Asp Arg 
Gly He Leu 
Pro Leu Gin 
Ser Gin Pro 
Arg Val Arg 
Ser Val Glu 
Glu He Ser 
Ser Thr Ser 
Thr Val Asp 
Phe Leu Lys 
Glu Arg Ala 
He Gin Thr 
Thr Ala Ser 
Asn Thr Glu 
Glu Glu Thr 
Val Lys Tyr 
Met Lys Pro 
Asn Asn Lys 
Lys Thr Phe 
Ala Tyr Asp 
Thr Lys Thr 
Glu Leu Ser 
Lys Pro Val 
Cys He Asn 
Thr Asp Asp 



Phe Phe 
Ser Thr 
Thr Ser 
Ala Ala 
Gin Gly 
Ser Lys 
Gly Glu 
Asp Lys 
Val Pro 
Val Thr 
Asp He 
Thr Tyr 
Asn Thr 
Pro Val 
Leu He 
Thr Thr 
Pro Tyr 
Pro Phe 
Arg Asp 
Tyr Glu 
Asp Glu 
He Cys 
He His 
Phe Leu 
He Leu 
Val Lys 
Tyr Lys 
Leu Asn 
Thr Gly 
Gin Gly 
His Leu 
Trp Cys 
Lys He 
Ala Val 
Val Asp 
Pro Lys 



Ala Leu 
Thr Thr 
Thr Ser 
Ser Pro 
Thr Ser 
Asn Phe 
Leu Phe 
Lys His 
Tyr He 
Val Tyr 
Ser Arg 
His Cys 
Tyr He 
Ala Gly 
Tyr Ala 
Val Asn 
Thr His 
Cys His 
Ala Thr 
Arg Arg 
Glu Tyr 
Asp Phe 
Asn Glu 
Thr Ser 
Asn Cys 
Lys Phe 
Thr Asn 
Leu Ser 
Asn Lys 
Ala Lys 
Arg Thr 
Arg Glu 
Asn Pro 
Lys Ala 
Gin Ala 
Val Cys 



He Lys Val 
Ser Pro Ser 
Lys Pro Ser 
Gin Asn Thr 
Thr Pro Thr 
Tyr Lys Tyr 
Arg Phe Asp 
Val Glu Gly 
Phe Lys Val 
Arg Gly Trp 
Ala He Pro 
Phe Ser Ala 
Asp Arg Asp 
Leu Thr Glu 
Glu Pro Gly 
Cys Glu Val 
Phe He Thr 
Asn Asn Ser 
Lys Val Trp 
Gly His Pro 
Thr He Ser 
Val He Trp 
Ser Phe His 
Asn Gin Glu 
Met Asn Ser 
Asn Lys Ser 
Gly Gly Leu 
Glu His Thr 
Ser Arg Gin 
61 y Leu Ser 
Ser Met Asn 
Gin Lys Lys 
Val Ser Val 
Met Gly Asp 
Ser Val Asn 
Tyr Ser Arg 



Cys Ser 
He Ser 
Asn Thr 
Ser Thr 
He Pro 
Arg Val 
Leu Asp 
He Leu 
Arg Lys 
Ser Gin 
Tyr Asn 
Met Ala 
Ser Glu 
Asn He 
Trp Phe 
Val Asp 
Ala Leu 
Gin Cys 
He Glu 
Thr Lys 
Trp Lys 
Lys Thr 
Phe Val 
Glu Thr 
Thr He 
His He 
Phe Leu 
Asn Tyr 
Lys Arg 
Thr Ala 
His He 
Asp Asn 
Met Ala 
Ala Phe 
He His 
Pro Leu 
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Val Thr Phe Lys Phe Val Asn Ser Thr Ala Thr Phe Arg Gly Gin Leu 
Gly Thr Arg Asn Glu He Leu Leu Thr Asn Thr His Val Glu Thr Cys 
Arg Pro Thr Ala Asp His Tyr Phe Phe Val Lys Asn Het Thr His Tyr 
Phe Lys Asp Tyr Lys Phe Val Lys Thr Met Asp Thr Asn Asn He Ser 
5 Thr Leu Asp Thr Phe Leu Thr Leu Asn Leu Thr Phe He Asp Asn He 

Asp Phe Lys Thr Val Glu Leu Tyr Ser Glu Thr Glu Arg Lys Met Ala 
Ser Ala Leu Asp Leu Glu Thr Met Phe Arg Glu Tyr Asn Tyr Tyr Thr 
Gin Lys Leu Ala Ser Leu Arg Glu Asp Leu Asp Asn Thr He Asp Leu 
Asn Arg Asp Arg Leu Val Lys Asp Leu Ser Glu Met Met Ala Asp Leu 

10 Gly Asp lie Gly Lys Val Val Val Asn Thr Phe Ser Gly He Val Thr 

Val Phe Gly Ser He Val Gly Gly Phe Val Ser Phe Phe Thr Asn Pro 
He Gly Gly Val Thr He He Leu Leu Leu He Val Val Val Phe Val 
Val Phe He Val Ser Arg Arg Thr Asn Asn Met Asn Glu Ala Pro He 
Lys Met He Tyr Pro Asn He Asp Lys Ala Ser Glu Gin Glu Asn He 

15 Gin Pro Leu Pro Gly Glu Glu He Lys Arg He Leu Leu Gly Met His 

Gin Leu Gin Gin Ser Glu His Gly Lys Ser Glu Glu Glu Ala Ser His 
Lys Pro Gly Leu Phe Gin Leu Leu Gly Asp Gly Leu Gin Leu Leu Arg 
Arg Arg Gly Tyr Thr Arg Leu Pro Thr Phe Asp Pro Ser Pro Gly Asn 
Asp Thr Ser Glu Thr His 61n Lys Tyr Val 

20 

SEQ ID NO: 16: 

Met Gly Val Gly Gly Gly Pro Arg Val Val Leu Cys Leu Trp Cys Val 
Ala Ala Leu Leu Cys Gin Gly Val Ala Gin Glu Val Val Ala Glu Thr 
Thr Thr Pro Phe Ala Thr His Arg Pro Glu Val Val Ala Glu Glu Asn 

25 Pro Ala Asn Pro Phe Leu Pro Phe Arg Val Cys Gly Ala Ser Pro Thr 

Gly Gly Glu He Phe Arg Phe Pro Leu Glu Glu Ser Cys Pro Asn Thr 
Glu Asp Lys Asp His He Glu Gly lie Ala Leu He Tyr Lys Thr Asn 
He Val Pro Tyr Val Phe Asn Val Arg Lys Tyr Arg Lys He Met Thr 
Ser Thr Thr He Tyr Lys Gly Trp Ser Glu Asp Ala He Thr Asn Gin 

30 His Thr Arg Ser Tyr Ala Val Pro Leu Tyr Glu Val Gin Met Met Asp 

His Tyr Tyr Gin Cys Phe Ser Ala Val Gin Val Asn Glu Gly Gly His 
Val Asn Thr Tyr Tyr Asp Arg Asp Gly Trp Asn Glu Thr Ala Phe Leu 
Lys Pro Ala Asp Gly Leu Thr Ser Ser He Thr Arg Tyr Gin Ser Gin 
Pro Glu Val Tyr Ala Thr Pro Arg Asn Leu Leu Trp Ser Tyr Thr Thr 

35 Arg Thr Thr Val Asn Cys Glu Val Thr Glu Met Ser Ala Arg Ser Met 

Lys Pro Phe Glu Phe Phe Val Thr Ser Val Gly Asp Thr He Glu Met 
Ser Pro Phe Leu Lys Glu Asn Gly Thr Glu Pro Glu Lys He Leu Lys 
Arg Pro His Ser He Gin Leu Leu Lys Asn Tyr Ala Val Thr Lys Tyr 
Gly Val Gly Leu Gly Gin Ala Asp Asn Ala Thr Arg Phe Phe Ala He 

40 Phe Gly Asp Tyr Ser Leu Ser Trp Lys Ala Thr Thr Glu Asn Ser Ser 

Tyr Cys Asp Leu He Leu Trp Lys Gly Phe Ser Asn Ala He Gin Thr 
Gin His Asn Ser Ser Leu His Phe He Ala Asn Asp lie Thr Ala Ser 
Phe Ser Thr Pro Leu Glu Glu Glu Ala Asn Phe Asn Glu Thr Phe Lys 
Cys He Trp Asn Asn Thr Gin Glu Glu He Gin Lys Lys Leu Lys Glu 

45 Val Glu Lys Thr His Arg Pro Asn Gly Thr Ala Lys Val Tyr Lys Thr 

Thr Gly Asn Leu Tyr He Val Trp Gin Pro Leu He Gin He Asp Leu 
Leu Asp Thr His Ala Lys Leu Tyr Asn Leu Thr Asn Ala Thr Ala Ser 
ProTVSerThrProThrThrSerProAf^ArgArgArgArgAspThr 
Ser Ser Val Ser Gly Gly Gly Asn Asn Gly Asp Asn Ser Thr Lys Glu 

50 Glu Ser Val Ala Ala Ser Gin Val Gin Phe Ala Tyr Asp Asn Leu Arg 

Lys Ser He Asn Arg Val Leu Gly Glu Leu Ser Arg Ala Trp Cys Arg 
Glu Gin Tyr Arg Ala Ser Leu Met Trp Tyr Glu Leu Ser Lys He Asn 
Pro Thr Ser Val Met Ser Ala He Tyr Gly Arg Pro Val Ser Ala Lys 
Leu He Gly Asp Val Val Ser Val Ser Asp Cys He Ser Val Asp Gin 

55 Lys Ser Val Phe Val His Lys Asn Met Lys Val Pro Gly Lys Glu Asp 

Leu Cys Tyr Thr Arg Pro Val Val Gly Phe Lys Phe He Asn Gly Ser 
Glu Leu Phe Ala Gly Gin Leu Gly Pro Arg Asn Glu He Val Leu Ser 
Thr Ser Gin Val Glu Val Cys Gin His Ser Cys Glu His Tyr Phe Gin 
Ala Gly Asn Gin Met Tyr lys Tyr Lys Asp Tyr Tyr Tyr Val Ser Thr 

60 Leu Asn Leu Thr Asp He Pro Thr Leu His Thr Met He Thr Leu Asn 

Leu Ser Leu Val Glu Asn He Asp Phe Lys Val lie Glu Leu Tyr Ser 
Lys Thr Glu Lys Arg Leu Ser Asn Val Phe Asp He Glu Thr Met Phe 
Arg Glu Tyr Asn Tyr Tyr Thr Gin Asn Leu Asn 61 y Leu Arg Lys Asp 
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Leu Asp Asp Ser He Asp His Gly Arg Asp Ser Phe He Gin Thr Leu 
Gly Asp He Net Gin Asp Leu Gly Thr lie Gly Lys Val Val Val Asn 
Val Ala Ser Gly Val Phe Ser Leu Phe Gly Ser lie Val Ser Gly Val 
He Ser Phe Phe Lys Asn Pro Phe Gly Gly Met Leu Leu He Val Leu 
5 He He Ala Gly Val Val Val Val Tyr Leu Phe Met Thr Arg Ser Arg 

Ser lie Tyr Ser Ala Pro He Arg Met Leu Tyr Pro Gly Val Glu Arg 
Ala Ala Gin Glu Pro Gly Ala His Pro Val Ser Glu Asp Gin He Arg 
Asn He Leu Met Gly Met His Gin Phe Gin Gin Arg Gin Arg Ala Glu 
Glu Glu Ala Arg Arg Glu Glu Glu Val Lys Gly Lys Arg Thr Leu Phe 
10 Glu Val He Arg Asp Ser Ala Thr Ser Val Leu Arg Arg Arg Arg Gly 

Gly Gly Gly Tyr Gin Arg Leu Gin Arg Asp Gly Ser Asp Asp Glu Gly 
Asp Tyr Glu Pro Leu Arg Arg Gin Asp Gly Gly Tyr Asp Asp Val Asp 
Val Glu Ala Gly Thr Ala Asp Thr Gly Val 

15 SEQ ID N0:17: 

Met Tyr Pro Thr Val Lys Ser Met Arg Val Ala His Leu Thr Asn Leu 
Leu Thr Leu Leu Cys Leu Leu Cys His Thr His Leu Tyr Val Cys Gin 
Pro Thr Thr Leu Arg Gin Pro Ser Asp Met Thr Pro Ala Gin Asp Ala 
Pro Thr Glu Thr Pro Pro Pro Leu Ser Thr Asn Thr Asn Arg Gly Phe 

20 Glu Tyr Phe Arg Val Cys Gly Val Ala Ala Thr Gly Glu Thr Phe Arg 

Phe Asp Leu Asp Lys Thr Cys Pro Ser Thr Gin Asp Lys Lys His Val 
Glu Gly He Leu Leu Val Tyr Lys He Asn He Val Pro Tyr lie Phe 
Lys He Arg Arg Tyr Arg Lys lie He Thr Gin Leu Thr He Trp Arg 
Gly Leu Thr Thr Ser Ser Val Thr Gly Lys Phe Glu Met Ala Thr Gin 

25 Ala His Glu Trp Glu Val Gly Asp Phe Asp Ser He Tyr Gin Cys Tyr 

Asn Ser Ala Thr Met Val Val Asn Asn Val Arg Gin Val Tyr Val Asp 
Arg Asp Gly Val Asn Lys Thr Val Asn He Arg Pro Val Asp Gly Leu 
Thr Gly Asn He Gin Arg Tyr Phe Ser Gin Pro Thr Leu Tyr Ser Glu 
Pro Gly Trp Met Pro Gly Phe Tyr Arg Val Arg Thr Thr Val Asn Cys 

30 Glu He Val Asp Met Val Ala Arg Ser Met Asp Pro Tyr Asn Tyr He 

Ala Thr Ala Leu Gly Asp Ser Leu Glu Leu Ser Pro Phe Gin Thr Phe 
Asp Asn Thr Ser Gin Cys Thr Ala Pro Lys Arg Ala Asp Met Arg Val 
Arg Glu Val Lys Asn Tyr Lys Phe Val Asp Tyr Asn Asn Arg Gly Thr 
Ala Pro Ala Gly Gin Ser Arg Thr Phe Leu Glu Thr Pro Ser Ala Thr 

35 Tyr Ser Trp Lys Thr Ala Thr Arg Gin Thr Ala Thr Cys Asp Leu Val 

His Trp Lys Thr Phe Pro Arg Ala He Gin Thr Ala His Glu His Ser 
Tyr His Phe Val Ala Asn Glu Val Thr Ala Thr Phe Asn Thr Pro Leu 
Thr Glu Val Glu Asn Phe Thr Ser Thr Tyr Ser Cys Val Ser Asp Gin 
He Asn Lys Thr lie Ser Glu Tyr He Gin Lys Leu Asn Asn Ser Tyr 

40 Val Ala Ser Gly Lys Thr Gin Tyr Phe Lys Thr Asp Gly Asn Leu Tyr 

Leu He Trp Gin Pro Leu Glu His Pro Glu lie Glu Asp He Asp Glu 
Asp Ser Asp Pro Glu Pro Thr Pro Ala Pro Pro Lys Ser Thr Arg Arg 
Lys Arg Glu Ala Ala Asp Asn Gly Asn Ser Thr Ser Glu Val Ser Lys 
Gly Ser Glu Asn Pro Leu He Thr Ala Gin He Gin Phe Ala Tyr Asp 

45 Lys Leu Thr Thr Ser Val Asn Asn Val Leu Glu Glu Leu Ser Arg Ala 

Trp Cys Arg Glu Gin Val Arg Asp Thr Leu Met Trp Tyr Glu Leu Ser 
Lys Val Asn Pro Thr Ser Val Met Ser Ala He Tyr Gly Lys Pro Val 
Ala Ala Arg Tyr Val Gly Asp Ala He Ser Val Thr Asp Cys He Tyr 
Val Asp Gin Ser Ser Val Asn He His Gin Ser Leu Arg Leu Gin His 

50 Asp Lys Thr Thr Cys Tyr Ser Arg Pro Arg Val Thr Phe Lys Phe He 

Asn Ser Thr Asp Pro Leu Thr Gly Gin Leu Gly Pro Arg Lys Glu He 
lie Leu Ser Asn Thr Asn He Glu Thr Cys Lys Asp Glu Ser Glu His 
Tyr Phe He Val Gly Glu Tyr He Tyr Tyr Tyr Lys Asn Tyr He Phe 
Glu Glu Lys Leu Asn Leu Ser Ser He Ala Thr Leu Asp Thr Phe He 

55 Ala Leu Asn He Ser Phe He Glu Asn He Asp Phe Lys Thr Val Glu 

Leu Tyr Ser Ser Thr Glu Arg Lys Leu Ala Ser Ser Val Phe Asp He 
Glu Ser Met Phe Arg Glu Tyr Asn Tyr Tyr Thr Tyr Ser Leu Ala Gly 
He Lys Lys Asp Leu Asp Asn Thr He Asp Tyr Asn Arg Asp Arg Leu 
Val Gin Asp Leu Ser Asp Met Met Ala Asp Leu Gly Asp He Gly Arg 

60 Ser Val Val Asn Val Val Ser Ser Val Val Thr Phe Phe Ser Ser He 

Val Thr Gly Phe He Lys Phe Phe Thr Asn Pro Leu Gly Gly He Phe 
He Leu Leu He He Gly Gly He He Phe Leu Val Val Val Leu Asn 
Arg Arg Asn Ser Gin Phe His Asp Ala Pro lie Lys Met Leu Tyr Pro 
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Ser Val Glu Asn Tyr Ala Ala Arg Gin Ala Pro Pro Pro Tyr Ser Ala 
Ser Pro Pro Ala He Asp Lys Glu Glu He Lys Arg He Leu Leu Gly 
Net His Gin Val His Gin Glu Glu Lys Glu Ala Gin Lys Gin Leu Thr 
Asn Ser Gly Pro Thr Leu Trp Gin Lys Ala Thr Gly Phe Leu Arg Asn 
5 Arg Arg Lys Gly Tyr Ser Gin Leu Pro Leu Glu Asp Glu Ser Thr Ser 

Leu 

SEQ 10 HO: 18: 

Met Thr Arg Arg Arg Val Leu Ser Val Val Val Leu Leu Ala Ala Leu 

10 Ala Cys Arg Leu Gly Ala Gin Thr Pro Glu Gin Pro Ala Pro Pro Ala 

Thr Thr Val Gin Pro Thr Ala Thr Arg Gin Gin Thr Ser Phe Pro Phe 
Arg Val Cys Glu Leu Ser Ser His Gly Asp Leu Phe Arg Phe Ser Ser 
Asp He Gin Cys Pro Ser Phe Gly Thr Arg Glu Asn His Thr Glu Gly 
Leu Leu Met Val Phe Lys Asp Asn He He Pro Tyr Ser Phe Lys Val 

15 Arg Ser Tyr Thr Lys He Val Thr Asn He Leu He Tyr Asn Gly Trp 

Tyr Ala Asp Ser Val Thr Asn Arg His Glu Glu Lys Phe Ser Val Asp 
Ser Tyr Glu Thr Asp Gin Met Asp Thr He Tyr Gin Cys Tyr Asn Ala 
Val Lys Met Thr Lys Asp Gly Leu Thr Arg Val Tyr Val Asp Arg Asp 
Gly Val Asn He Thr Val Asn Leu Lys Pro Thr Gly Gly Leu Ala Asn 

20 Gly Val Arg Arg Tyr Ala Ser Gin Thr Glu Leu Tyr Asp Ala Pro Gly 

Trp Leu lie Trp Thr Tyr Arg Thr Arg Thr Thr Val Asn Cys Leu He 
Thr Asp Met Met Ala Lys Ser Asn Ser Pro Phe Asp Phe Phe Val Thr 
Thr Thr Gly Gin Thr Val Glu Met Ser Pro Phe Tyr Asp Gly Lys Asn 
Lys Glu Thr Phe His Glu Arg Ala Asp Ser Phe His Val Arg Thr Asn 

25 Tyr Lys He Val Asp Tyr Asp Asn Arg Gly Thr Asn Pro Gin Gly Glu 

Arg Arg Ala Phe Leu Asp Lys Gly Thr Tyr Thr Leu Ser Trp Lys Leu 
Glu Asn Arg Thr Ala Tyr Cys Pro Leu Gin His Trp Gin Thr Phe Asp 
Ser Thr He Ala Thr Glu Thr Gly lys Ser lie His Phe Val Thr Asp 
Glu Gly Thr Ser Ser Phe Val Thr Asn Thr Thr Val Gly He Glu Leu 

30 Pro Asp Ala Phe Lys Cys He Glu Glu Gin Val Asn Lys Thr Met His 

Glu Lys Tyr Glu Ala Val Gin Asp Arg Tyr Thr Lys Gly Gin Glu Ala 
He Thr Tyr Phe He Thr Ser Gly Gly Leu Leu Leu Ala Trp Leu Pro 
Leu Thr Pro Arg Ser Leu Ala Thr Val Lys Asn Leu Thr Glu Leu Thr 
Thr Pro Thr Ser Ser Pro Pro Ser Ser Pro Ser Pro Pro Ala Pro Ser 

35 Ala Ala Arg Gly Ser Thr Pro Ala Ala Val Leu Arg Arg Arg Arg Arg 

Asp Ala Gly Asn Ala Thr Thr Pro Val Pro Pro Thr Ala Pro Gly Lys 
Ser Leu Gly Thr Leu Asn Asn Pro Ala Thr Val Gin He Gin Phe Ala 
Tyr Asp Ser Leu Arg Arg Gin He Asn Arg Met Leu Gly Asp Leu Ala 
Arg Ala Trp Cys Leu Glu Gin lys Arg Gin Asn Met Val Leu Arg Glu 

40 Leu Thr Lys He Asn Pro Thr Thr Val Met Ser Ser He Tyr Gly Lys 

Ala Val Ala Ala Lys Arg Leu Gly Asp Val He Ser Val Ser Gin Cys 
Val Pro Val Asn Gin Ala Thr Val Thr Leu Arg Lys Ser Met Arg Val 
Pro Gly Ser Glu Thr Met Cys Tyr Ser Arg Pro Leu Val Ser Phe Ser 
Phe He Asn Asp Thr Lys Thr Tyr Glu Gly Gin Leu Gly Thr Asp Asn 

45 Glu He Phe Leu Thr Lys Lys Met Thr Glu Val Cys Gin Ala Thr Ser 

Gin Tyr Tyr Phe Gin Ser Gly Asn Glu He His Val Tyr Asn Asp Tyr 
His His Phe Lys Thr He Glu Leu Asp Gly He Ala Thr Leu Gin Thr 
Phe He Ser Leu Asn Thr Ser Leu He Glu Asn He Asp Phe Ala Ser 
Leu Glu Leu Tyr Ser Arg Asp Glu Gin Arg Ala Ser Asn Val Phe Asp 

50 Leu Glu Gly He Phe Arg Glu Tyr Asn Phe Gin Ala Gin Asn He Ala 

Gly Leu Arg Lys Asp Leu Asp Asn Ala Val Ser Asn Gly Arg Asn Gin 
Phe Val Asp Gly Leu Gly Glu Leu Met Asp Ser Leu Gly Ser Val Gly 
Gin Ser He Thr Asn Leu Val Ser Thr Val Gly Gly Leu Phe Ser Ser 
Leu Val Ser Gly Phe He Ser Phe Phe lys Asn Pro Phe Gly Gly Met 

55 Leu He Leu Val Leu Val Ala Gly Val Val He Leu Val He Ser Leu 

Thr Arg Arg Thr Arg Gin Met Ser Gin Gin Pro Val Gin Met Leu Tyr 
Pro Gly He Asp Glu Leu Ala Gin Gin His Ala Ser Gly Glu Gly Pro 
Gly He Asn Pro He Ser Lys Thr Glu Leu Gin Ala He Met Leu Ala 
Leu His Glu Gin Asn Gin Glu Gin lys Arg Ala Ala Gin Arg Ala Ala 

60 Gly Pro Ser Val Ala Ser Arg Ala Leu Gin Ala Ala Arg Asp Arg Phe 

Pro Gly Leu Arg Arg Arg Arg Tyr His Asp Pro Glu Thr Ala Ala Ala 
Leu Leu Gly Glu Ala Glu Thr Glu Phe 
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SEQ ID NO: 19 
Met Glu 
Val Cys 
Ala Thr 
5 Arg Ser 

His Gly 
Val Val 
Ala Gin 
Ser Met 
10 Tyr Lys 

Lys Val 
Leu Leu 
His His 
He Ala 
15 Lys Thr 

Arg Tyr 
Leu Tyr 
Ala Arg 
Val Val 
20 Tyr Phe 

He Val 
Leu Val 
Gin Asp 
Glu Arg 
25 Ala Lys 

Ser Asp 
Gin Gin 
Asn Val 
Gly lie 
30 Ser Ser 

Asn Asn 
Tyr Ala 
Arg Ala 
Thr Leu 
35 Leu Ser 

Val Leu 
Val Leu 
Arg Pro 
Gly Gin 
40 Glu Glu 

Ala Tyr 
Ser lie 
Glu Asn 
Arg Ser 
45 Ser Tyr 

Leu Pro 
Ala Ala 
Val Ala 
Gly Ala 
50 Tyr Leu 

Asn Leu 
Gly Asn 
Ser Val 
Ala Ser 
55 Leu Leu 

Gly Thr 
Lys Pro 
His Leu 



Ser Arg He 
Leu Gly Ala 
His Ser His 
Gly Ser Val 
Val Asn Glu 
Gly Val Asn 
Gly Thr Asp 
Lys Pro He 
Arg Asn lie 
Leu Thr Phe 
Gly Ser Asn 
He Asn Ser 
Gly Thr Val 
Met Gin Leu 
Val Tlir Val 
Arg Glu Thr 
Ser Lys Tyr 
Asp He Ser 
Gly Glu Asn 
Ser Asp Phe 
Ala Phe Leu 
Glu Lys Asn 
Thr He Arg 
Met Thr Ala 
Ser Ala Leu 
He Phe Asn 
Ser Val Phe 
Lys Gin Lys 
Leu Asn Leu 
Ala Thr His 
Gin Leu Gin 
Leu Ala Gin 
Glu Val Phe 
Ala He Tyr 
Gly Leu Ala 
Arg Asp Met 
Val Val He 
Leu Gly Glu 
Cys Gin Leu 
Glu Tyr Val 
Ser Thr Val 
Thr Asp Phe 
Ser Asn Val 
Lys Gin Arg 
Pro Tyr Leu 
Gly Lys Ala 
Ser Val Val 
Phe Thr He 
He Tyr Thr 
Phe Pro Tyr 
Thr Lys Asp 
Tyr Asn Ser 
Thr Ala Ala 
Ala Leu Val 
Asp Ser Leu 
Asn Leu Leu 
Lys Asp Ser 



Trp Cys Leu 
Ala Val Ser 
His Ser Ser 
Ser Gin Arg 
Thr He Tyr 
Thr Thr Lys 
Leu He Arg 
Asn Glu Asp 
Val Ala His 
Arg Arg Ser 
Thr Glu Tyr 
His Ser Gin 
Phe Val Ala 
Met Pro Asp 
Lys Asp Gin 
Cys Asn Leu 
Pro Tyr His 
Pro Phe Tyr 
Ala Asp Lys 
Gly Arg Pro 
Glu Arg Ala 
Val Thr Cys 
Ser Glu Ala 
Thr Phe Leu 
Asp Cys Val 
Thr Ser Tyr 
Glu Thr Thr 
Ser Leu Val 
Thr His Asn 
Leu Ser Asn 
Phe Thr Tyr 
He Ala Glu 
Lys Glu Leu 
Asn Lys Pro 
Ser Cys Val 
Asn Val Lys 
Phe Asn Phe 
Asp Asn Glu 
Pro Ser Leu 
Asp Tyr Leu 
Asp Ser Met 
Arg Val Leu 
Phe Asp Leu 
Val Lys Tyr 
Lys Gly Leu 
Val Gly Val 
Glu Gly Val 
He Leu Val 
Arg Gin Arg 
Leu Val Ser 
Thr Ser Leu 
Gly Arg Lys 
Pro Pro Tyr 
Arg Leu Asp 
Asp Gly Gin 
Asp Arg Leu 
Asp Glu Glu 



Val Val Cys 
Ser Ser Ser 
his Thr Thr 
Val Thr Ser 
Asn Thr Thr 
Tyr Pro Tyr 
Phe Glu Arg 
Leu Asp Glu 
Thr Phe Lys 
Tyr Ala Tyr 
Val Ala Pro 
Cys Tyr Ser 
Tyr His Arg 
Asp Tyr Ser 
Trp His Ser 
Asn Cys Met 
Phe Phe Ala 
Asn Gly Thr 
Phe Phe He 
Asn Ser Ala 
Asp Ser Val 
Gin Leu Thr 
Glu Asp Ser 
Ser Lys Lys 
Arg Asp Glu 
Asn Gin Thr 
Gly Gly Leu 
Glu Leu Glu 
Arg Thr Lys 
Met Glu Ser 
Asp Thr Leu 
Ala Trp Cys 
Ser Lys He 
He Ala Ala 
Thr He Asn 
Glu Ser Pro 
Ala Asn Ser 
He Leu Leu 
Lys He Phe 
Phe Lys Arg 
He Ala Leu 
Glu Leu Tyr 
Glu Glu He 
Val Glu Asp 
Asp Asp Leu 
Ala He Gly 
Ala Thr Phe 
Ala He Ala 
Arg Leu Cys 
Ala Asp Gly 
Gin Ala Pro 
Gly Pro Gly 
Thr Asn Glu 
Ala Glu Gin 
Thr Gly Thr 
Arg His Arg 
Glu Asn Val 



Val Asn 
Thr Arg 
Ser Ala 
Ser Gin 
Leu Lys 
Arg Val 
Asn He 
Gly lie 
Val Arg 
He His 
Pro Met 
Ser Tyr 
Asp Ser 
Asn Thr 
Arg Gly 
Val Thr 
Thr Ser 
Asn Arg 
Phe Pro 
Leu Glu 
He Ser 
Phe Trp 
Tyr His 
Gin Glu 
Ala He 
Tyr Glu 
Val Val 
Arg Leu 
Arg Ser 
Val His 
Arg Gly 
Val Asp 
Asn Pro 
Arg Phe 
Gin Thr 
Gly Arg 
Ser Tyr 
Gly Asn 
He Ala 
Met He 
Asp He 
Ser Gin 
Met Arg 
Lys Val 
Met Ser 
Ala Val 
Leu Lys 
Val Val 
Met Gin 
Thr Thr 
Pro Ser 
Pro Pro 
Gin Ala 
Arg Ala 
Gin Asp 
Lys Asn 



Leu Cys He 
Gly Thr Ser 
Ala His Ser 
Thr Val Ser 
Tyr Gly Asp 
Cys Ser Met 
Val Cys Thr 
Met Val Val 
Val Tyr Gin 
Thr Thr Tyr 
Trp Glu He 
Ser Arg Val 
Tyr Glu Asn 
His Ser Thr 
Ser Thr Trp 
He Thr Thr 
Thr Gly Asp 
Asn Ala Ser 
Asn Tyr Thr 
Thr His Arg 
Trp Asp He 
Glu Ala Ser 
Phe Ser Ser 
Val Asn Met 
Asn Lys Leu 
Lys Tyr Gly 
Phe Trp Gin 
Ala Asn Arg 
Thr Asp Gly 
Asn Leu Val 
Tyr He Asn 
Gin Arg Arg 
Ser Ala He 
Met Gly Asp 
Ser Val Lys 
Cys Tyr Ser 
Val Gin Tyr 
His Arg Thr 
Gly Asn Ser 
Asp Leu Ser 
Asp Pro Leu 
Lys Glu Leu 
Glu Phe Asn 
Val Asp Pro 
Gly Leu Gly 
Gly Gly Ala 
Asn Pro Phe 
He He He 
Pro Leu Gin 
Val Thr Ser 
Tyr Glu Glu 
Ser Ser Asp 
Tyr Gin Met 
Gin Gin Asn 
Lys Gly Gin 
Gly Tyr Arg 



60 SEQ ID N0:20: 

Met Ser Lys Met Val Val Leu Phe Leu Ala Val Phe Leu Met Asn Ser 
Val Leu Met He Tyr Cys Asp Pro Asp His Tyr He Arg Ala Gly Tyr 
Asn His Lys Tyr Pro Phe Arg He Cys Ser He Ala Lys Gly Thr Asp 
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Leu Met Arg Phe Asp Arg Asp He Ser Cys Ser Pro Tyr Lys Ser Asn 
Ala Lys Net Ser Glu Gly Phe Phe He He Tyr Lys Thr Asn He Glu 
Thr Tyr Thr Phe Pro Val Arg Thr Tyr Lys Lys Glu Leu Thr Phe Gin 
Ser Ser Tyr Arg Asp Val Gly Val Val Tyr Phe Leu Asp Arg Thr Val 
5 Met Gly Leu Ala Met Pro Val Tyr Glu Ala Asn Leu Val Asn Ser His 

Ala Gin Cys Tyr Ser Ala Val Ala Het Lys Arg Pro Asp Gly Thr Val 
Phe Ser Ala Phe His Glu Asp Asn Asn Lys Asn Asn Thr Leu Asn Leu 
Phe Pro Leu Asn Phe Lys Ser He Thr Asn Lys Arg Phe He Thr Thr 
Lys Glu Pro Tyr Phe Ala Arg Gly Pro Leu Trp Leu Tyr Ser Thr Ser 

10 Thr Ser Leu Asn Cys He Val Thr Glu Ala Thr Ala Lys Ala Lys Tyr 

Pro Phe Ser Tyr Phe Ala Leu Thr Thr Gly Glu He Val Glu Gly Ser 
Pro Phe Phe Asn Gly Ser Asn Gly Lys His Phe Ala Glu Pro Leu Glu 
Lys Leu Thr He Leu Glu Asn Tyr Thr Met lie Glu Asp Leu Met Asn 
Gly Met Asn Gly Ala Thr Thr Leu Val Arg Lys He Ala Phe Leu Glu 

15 Lys Ala Asp Thr Leu Phe Ser Trp Glu He Lys Glu Glu Asn Glu Ser 

Val Cys Met Leu Lys His Trp Thr Thr Val Thr His Gly Leu Arg Ala 
Glu Thr Asp Glu Thr Tyr His Phe He Ser Lys Glu Leu Thr Ala Ala 
Phe Val Ala Pro Lys Glu Ser Leu Asn Leu Thr Asp Pro l^ys Gin Thr 
Cys He Lys Asp Glu Phe Glu Lys He He Asn Glu Val Tyr Het Ser 

20 Asp Tyr Asn Asp Thr Tyr Ser Met Asn Gly Ser Tyr Gin He Phe Lys 

Thr Thr Gly Asp Leu lie Leu He Trp Gin Pro Leu Val Gin Lys Ser 
Leu Met Phe Leu Glu Gin Gly Ser Glu Lys lie Arg Arg Arg Arg Asp 
Val Val Asp Val Lys Ser Arg His Asp He Leu Tyr Val Gin Leu Gin 
Tyr Leu Tyr Asp Thr Leu Lys Asp Tyr He Asn Asp Ala Leu Gly Asn 

25 Leu Ala Glu Ser Trp Cys Leu Asp Gin Lys Arg Thr He Thr Het Leu 

His Glu Leu Ser Lys He Ser Pro Ser Ser He Val Ser Glu Val Tyr 
Gly Arg Pro He Ser Ala Gin Leu his Gly Asp Val Leu Ala He Ser 
Lys Cys He Glu Val Asn Gin Ser Ser Val Gin Leu His Lys Ser Het 
Arg Val Val Asp Ala Lys Gly Val Arg Ser Glu Thr Het Cys Tyr Asn 

30 Arg Pro Leu Val Thr Phe Ser Phe Val Asn Ser Thr Pro Glu Val Val 

Pro Gly Gin Leu Gly Leu Asp Asn Glu He Leu Leu Gly Asp His Arg 
Thr Glu Glu Cys Glu He Pro Ser Thr Lys He Phe Leu Ser Gly Asn 
His Ala His Val Tyr Thr Asp Tyr Thr His Thr Asn Ser Thr Pro He 
Glu Asp He Glu Val Leu Asp Ala Phe lie Arg Leu Lys He Asp Pro 

35 Leu Glu Asn Ala Asp Phe Lys Val Leu Asp Leu Tyr Ser Pro Asp Glu 

Leu Ser Arg Ala Asn Val Phe Asp Leu Glu Asn He Leu Arg Glu Tyr 
Asn Ser Tyr Lys Ser Ala Leu Tyr Thr He Glu Ala Lys He Ala Thr 
Asn Thr Pro Ser Tyr Val Asn Gly He Asn Ser Phe Leu Gin Gly Leu 
Gly Ala He Gly Thr Gly Leu Gly Ser Val He Ser Val Thr Ala Gly 

40 Ala Leu Gly Asp He Val Gly Gly Val Val Ser Phe Leu Lys Asn Pro 

Phe Gly Gly Gly Leu Met Leu lie Leu Ala He Val Val Val Val He 
He He Val Val Phe Val Arg Gin Arg His Val Leu Ser Lys Pro He 
Asp Met Met Phe Pro Tyr Ala Thr Asn Pro Val Thr Thr Val Ser Ser 
Val Thr Gly Thr Thr Val Val Lys Thr Pro Ser Val Lys Asp Val Asp 

45 Gly Gly Thr Ser Val Ala Val Ser Glu Lys Glu Glu Gly Met Ala Asp 

Val Ser Gly Gin Val Ser Asp Asp Glu Tyr Ser Gin Glu Ala Ala Leu 
Lys Het Leu Lys Ala He Lys Ser Leu Asp Glu Ser Tyr Arg Arg Lys 
Pro Ser Ser Ser Glu Ser His Ala Ser Lys Pro Ser Leu He Asp Arg 
He Arg Tyr Arg Gly Tyr Lys Ser Val Asn Val Glu Glu Ala 

50 

SEQ ID NO: 21: 

Met Phe Val Thr Ala Val Val Ser Val Ser Pro Ser Ser Phe Tyr Glu 
Ser Leu Gin Val 61u Pro Thr Gin Ser Glu Asp He Thr Arg Ser Ala 
His Leu Gly Asp Gly Asp Glu lie Arg Glu Ala He His Lys Ser Gin 

55 Asp Ala Glu Thr Lys Pro Thr Phe Tyr Val Cys Pro Pro Pro Thr Gly 

Ser Thr He Val Arg Leu Glu Pro Thr Arg Thr Cys Pro Asp Tyr His 
Leu Gly Lys Asn Phe Thr Glu Gly He Ala Val Val Tyr Lys Glu Asn 
He Ala Ala Tyr Lys Phe Lys Ala Thr Val Tyr Tyr Lys Asp Val He 
Val Ser Thr Ala Trp Ala Gly Ser Ser Tyr Thr Gin He Thr Asn Arg 

60 Tyr Ala Asp Arg Val Pro He Pro Val Ser Glu He Thr Asp Thr He 

Asp Lys Phe Gly Lys Cys Ser Ser Lys Ala Thr Tyr Val Arg Asn Asn 
His Lys Val Glu Ala Phe Asn Glu Asp Lys Asn Pro Gin Asp Het Pro 
Leu He Ala Ser Lys Tyr Asn Ser Val Gly Ser Lys Ala Trp His Thr 
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Thr Asn Asp Thr Tyr Met Val Ala Gly Thr Pro Gly Thr Tyr Arg Thr 
Gly Thr Ser Val Asn Cys He He Glu Glu Val Glu Ala Arg Ser He 
Phe Pro Tyr Asp Ser Phe Gly Leu Ser Thr Gly Asp He He Tyr Met 
Ser Pro Phe Phe Gly Leu Arg Asp Gly Ala Tyr Arg Glu His Ser Asn 
5 Tyr Ala Met Asp Arg Phe His Gin Phe Glu Gly Tyr Arg Gin Arg Asp 

Leu Asp Thr Arg Ala Leu Leu Glu Pro Ala Ala Arg Asn Phe Leu Val 
Thr Pro His Leu Thr Val Gly Trp Asn Trp Lys Pro Lys Arg Thr Glu 
Val Cys Ser Leu Val Lys Trp Arg Glu Val Glu Asp Val Val Arg Asp 
Glu Tyr Ala His Asn Phe Arg Phe Thr Met Lys Thr Leu Ser Thr Thr 
10 Phe He Ser Glu Thr Asn Glu Phe Asn Leu Asn Gin He His Leu Ser 

Gin Cys Val Lys Glu Glu Ala Arg Ala He He Asn Arg lie Tyr Thr 
Thr Arg Tyr Asn Ser Ser His Val Arg Thr Gly Asp He Gin Thr Tyr 
Leu Ala Arg Gly Gly Phe Val Val Val Phe Gin Pro Leu Leu Ser Asn 
Ser Leu Ala Arg Leu Tyr Leu Gin Glu Leu Val Arg Glu Asn Thr Asn 
15 His Ser Pro Gin Lys His Pro Thr Arg Asn Thr Arg Ser Arg Arg Ser 

Val Pro Val Glu Leu Arg Ala Asn Arg Thr He Thr Thr Thr Ser Ser 
Val Glu Phe Ala Met Leu Gin Phe Thr Tyr Asp His lie Gin Glu His 
Val Asn Glu Met Leu Ala Arg He Ser Ser Ser Trp Cys Gin Leu Gin 
Asn Arg Glu Arg Ala Leu Trp Ser Gly Leu Phe Pro He Asn Pro Ser 
20 Ala Leu Ala Ser Thr lie Leu Asp Gin Arg Val Lys Ala Arg He Leu 

Gly Asp Val lie Ser Val Ser Asn Cys Pro Glu Leu Gly Ser Asp Thr 
Arg lie He Leu Gin Asn Ser Met Arg Val Ser Gly Ser Thr Thr Arg 
Cys Tyr Ser Arg Pro Leu He Ser He Val Ser Leu Asn Gly Ser Gly 
Thr Val Glu Gly Gin Leu Gly Thr Asp Asn Glu Leu lie Met Ser Arg 
25 Asp Leu Leu Glu Pro Cys Val Ala Asn His Lys Arg Tyr Phe Leu Phe 

Gly His His Tyr Val Tyr Tyr Glu Asp Tyr Arg Tyr Val Arg Glu He 
Ala Val His Asp Val Gly Met He Ser Thr Tyr Val Asp Leu Asn Leu 
Thr Leu Leu Lys Asp Arg Glu Phe Met Pro Leu Gin Val Tyr Thr Arg 
Asp Glu Leu Arg Asp Thr Gly Leu Leu Asp Tyr Ser Glu lie Gin Arg 
30 Arg Asn Gin Met His Ser Leu Arg Phe Tyr Asp He Asp Lys Val Val 

Gin Tyr Asp Ser Gly Thr Ala He Met Gin Gly Met Ala Gin Phe Phe 
Gin Gly Leu Gly Thr Ala Gly Gin Ala Val Gly His Val Val Leu Gly 
Ala Thr Gly Ala Leu Leu Ser Thr Val His Gly Phe Thr Thr Phe Leu 
Ser Asn Pro Phe Gly Ala Leu Ala Val Gly Leu Leu Val Leu Ala Gly 
35 Leu Val Ala Ala Phe Phe Ala Tyr Arg Tyr Val Leu Lys Leu Lys Thr 

Ser Pro Met Lys Ala Leu Tyr Pro Leu Thr Thr Lys Gly Leu Lys Gin 
Leu Pro Glu Gly Met Asp Pro Phe Ala Glu Lys Pro Asn Ala Thr Asp 
Thr Pro He Glu Glu He Gly Asp Ser Gin Asn Thr Glu Pro Ser Val 
Asn Ser Gly Phe Asp Pro Asp Lys Phe Arg Glu Ala Gin Glu Met He 
40 Lys Tyr Met Thr Leu Val Ser Ala Ala Glu Arg Gin Glu Ser Lys Ala 

Arg Lys Lys Asn Lys Thr Ser Ala Leu Leu Thr Ser Arg Leu Thr Gly 
Leu Ala Leu Arg Asn Arg Arg Gly Tyr Ser Arg Val Arg Thr Glu Asn 
Val Thr Gly Val 

ID N0:22: 

Met Arg Gin Gly Ala Ala Arg Gly Cys Arg Trp Phe Val Val Trp Ala 
Leu Leu Gly Leu Thr Leu Gly Val Leu Val Ala Ser Ala Ala Pro Ser 
Ser Pro Gly Thr Pro Gly Val Ala Ala Ala Thr Gin Ala Ala Asn Gly 
Gly Pro Ala Thr Pro Ala Pro Pro Ala Pro 61y Pro Ala Pro Thr Gly 
Asp Thr Lys Pro Lys Lys Asn Lys Lys Pro Lys Asn Pro Pro Pro Pro 
Arg Pro Ala Gly Asp Asn Ala Thr Val Ala Ala Gly His Ala Thr Leu 
Arg Glu His Leu Arg Asp He Lys Ala Glu Asn Thr Asp Ala Asn Phe 
Tyr Val Cys Pro Pro Pro Thr Gly Ala Thr Val Val Gin Phe Glu Gin 
Pro Arg Arg Cys Pro Thr Arg Pro Glu Gly Gin Asn Tyr Tfcr Glu Gly 
He Ala Val Val Phe Lys Glu Asn He Ala Pro Tyr Lys Phe Lys Ala 
Thr Met Tyr Tyr Lys Asp Val Thr Val Ser Gin Val Trp Phe Gly His 
Arg Tyr Ser Gin Phe Met Gly lie Phe Glu Asp Arg Ala Pro Val Pro 
Phe Glu Glu Val lie Asp Lys He Asn Ala Lys Gly Val Cys Arg Ser 
Thr Ala Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe His Arg 
Asp Asp His Glu Thr Asp Met Glu Leu Lys Pro Ala Asn Ala Ala Thr 
Arg Thr Ser Arg Gly Trp His Thr Thr Asp Leu Lys Tyr Asn Pro Ser 
Arg Val Glu Ala Phe His Arg Tyr Gly Thr Thr Val Asn Cys He Val 
Glu Glu Val Asp Ala Arg Ser Val Tyr Pro Tyr Asp Glu Phe Val Leu 



45 SEQ 
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Ala Thr Gly Asp Phe Val Tyr Met Ser Pro Phe Tyr Gly Tyr Arg Glu 
Gly Ser His Thr Glu His Thr Ser Tyr Ala Ala Asp Arg Phe Lys Gin 
Val Asp Gly Phe Tyr Ala Arg Asp Leu Thr Thr Lys Ala Arg Ala Thr 
Ala Pro Thr Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr Val Ala 
5 Trp Asp Trp Val Pro Lys Arg Pro Ser Val Cys Thr Met Thr Lys Trp 

Gin Glu Val Asp Glu Met Leu Arg Ser Glu Tyr Gly Gly Ser Phe Arg 
Phe Ser Ser Asp Ala He Ser Thr Thr Phe Thr Thr Asn Leu Thr Glu 
Tyr Pro Leu Ser Arg Val Asp Leu Gly Asp Cys He Gly Lys Asp Ala 
Arg Asp Ala Met Asp Arg He Phe Ala Arg Arg Tyr Asn Ala Thr His 

10 lie Lys Val Gly Gin Pro Gin Tyr Tyr Leu Ala Asn Gly Gly Phe Leu 

He Ala Tyr Gin Pro Leu Leu Ser Asn Thr Leu Ala Glu Leu Tyr Val 
Arg Glu His Leu Arg Glu Gin Ser Arg Lys Pro Pro Asn Pro Thr Pro 
Pro Pro Pro Gly Ala Ser Ala Asn Ala Ser Val Glu Arg He Lys Thr 
Thr Ser Ser He Glu Phe Ala Arg Leu Gin Phe Thr Tyr Asn His He 

15 Gin Arg His Val Asn Asp Net Leu Gly Arg Val Ala He Ala Trp Cys 

Glu Leu Gin Asn His Glu Leu Thr Leu Trp Asn Glu Ala Arg Lys Leu 
Asn Pro Asn Ala He Ala Ser Ala Thr Val Gly Arg Arg Val Ser Ala 
Arg Net Leu Gly Asp Val Net Ala Val Ser Thr Cys Val Pro Val Ala 
Ala Asp Asn Val He Val Gin Asn Ser Net Arg He Ser Ser Arg Pro 

20 Gly Ala Cys Tyr Ser Arg Pro Lai Val Ser Phe Arg Tyr Glu Asp Gin 

Gly Pro Leu Val Glu Gly Gin Val Gly Glu Asn Asn Glu Leu Arg Leu 
Thr Arg Asp Ala He Glu Pro Cys Thr Val Gly His Arg Arg Tyr Phe 
Thr Phe Gly Gly Gly Tyr Val Tyr Phe Glu Glu Tyr Ala Tyr Ser His 
Gin Leu Ser Arg Ala Asp He Thr Thr Val Ser Thr Phe He Asp Leu 

25 Asn He Thr Net Leu Glu Asp His Glu Phe Val Pro Leu Glu Val Tyr 

Thr Arg His Glu He Lys Asp Ser Gly Leu Leu Asp Tyr Thr Glu Val 
Gin Arg Arg Asn Gin Leu His Asp Leu Arg Phe Ala Asp He Asp Thr 
Val He His Ala Asp Ala Asn Ala Ala Net Phe Ala Gly Leu Gly Ala 
Phe Phe Glu Gly Net Gly Asp Leu Gly Arg Ala Val Gly Lys Val Val 

30 Net Gly He Val Gly Gly Val Val Ser Ala Val Ser Gly Val Ser Ser 

Phe Net Ser Asn Pro Phe Gly Ala Leu Ala Val Gly Leu Leu Val Leu 
Ala Gly Leu Ala Ala Ala Phe Phe Ala Phe Arg Tyr Val Net Arg Leu 
Gin Ser Asn Pro Net Lys Ala Leu Tyr Pro Leu Thr Thr Lys Glu Leu 
Lys Asn Pro Thr Asn Pro Asp Ala Ser Gly Glu Gly Glu Glu Gly Gly 

35 Asp Phe Asp Glu Ala Lys Leu Ala Glu Ala Arg Glu Net He Arg Tyr 

Net Ala Leu Val Ser Ala Net Glu Arg Thr Glu His Lys Ala Lys Lys 
Lys Gly Thr Ser Ala Leu Leu Ser Ala Lys Val Thr Asp Net Val Net 
Arg Lys Arg Arg Asn Thr Asn Tyr Thr Gin Val Pro Asn Lys Asp Gly 
Asp Ala Asp Glu Asp Asp Leu 

40 

SEQ 10 NO: 23: 

Net Arg Pro Arg Gly Thr Pro Pro Ser Phe Leu Pro Leu Pro Val Leu 
Leu Ala Leu Ala Val He Ala Ala Ala Gly Arg Ala Ala Pro Ala Ala 
Ala Ala Ala Pro Thr Ala Asp Pro Ala Ala Thr Pro Ala Leu Pro Glu 

45 Asp Glu Glu Val Pro Asp Glu Asp Gly Glu Gly Val Ala Thr Pro Ala 

Pro Ala Ala Asn Ala Ser Val Glu Ala Gly Arg Ala Thr Leu Arg Glu 
Asp Leu Arg Glu He Lys Ala Arg Asp Gly Asp Ala Thr Phe Tyr Val 
Cys Pro Pro Pro Thr Gly Ala Thr Val Val Gin Phe Glu Gin Pro Arg 
Pro Cys Pro Arg Ala Pro Asp Gly Gin Asn Tyr Thr Glu Gly He Ala 

50 Val Val Phe Lys Glu Asn He Ala Pro Tyr Lys Phe Lys Ala Thr Net 

Tyr Tyr Lys Asp Val Thr Val Ser Gin Val Trp Phe Gly His Arg Tyr 
Ser Gin Phe Net Gly He Phe 61 u Asp Arg Ala Pro Val Pro Phe Glu 
Glu Val Net Asp Lys He Asn Ala Lys Gly Val Cys Arg Ser Thr Ala 
Lys Tyr Val Arg Asn Asn Net Glu Ser Thr Ala Phe His Arg Asp Asp 

55 His Glu Ser Asp Net Ala Leu Lys Pro Ala Lys Ala Ala Thr Arg Thr 

Ser Arg Gly Trp his Thr Thr Asp Leu Lys Tyr Asn Pro Ala Arg Val 
Glu Ala Phe His Arg Tyr Gly Thr Thr Val Asn Cys He Val Glu Glu 
Val Glu Ala Arg Ser Val Tyr Pro Tyr Asp Glu Phe Val Leu Ala Thr 
Gly Asp Phe Val Tyr Net Ser Pro Phe Tyr Gly Tyr Arg Asp Gly Ser 

60 His Gly Glu His Thr Ala Tyr Ala Ala Asp Arg Phe Arg Gin Val Asp 

Gly Tyr Tyr Glu Arg Asp Leu Ser Thr Gly Arg Arg Ala Ala Ala Pro 
Val Thr Arg Asn Leu Leu Thr Thr Pro Lys Phe Thr Val Gly Trp Asp 
Trp Ala Pro Lys Arg Pro Ser Val Cys Thr Leu Thr Lys Trp Arg Glu 

)03 
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Val Asp Glu Met Leu Arg Ala 61u Tyr 61y Pro Ser Phe Arg Phe Ser 
Set Ala Ala Leu Ser Thr Thr Phe Thr Ala Asn Arg Thr Glu Tyr Ala 
Leu Ser Arg Val Asp Leu Ala Asp Cys Val Gly Arg Glu Ala Arg Glu 
Ala Val Asp Arg lie Phe Leu Arg Arg Tyr Asn Gly Thr His Val Lys 
5 Val Gly Gin Val Gin Tyr Tyr Leu Ala Thr Gly Gly Phe Leu He Ala 

Tyr Gin Pro Leu Leu Ser Asn Ala Leu Val Glu Leu Tyr Val Arg Glu 
Leu Val Arg Glu Gin Thr Arg Arg Pro Ala Gly Gly Asp Pro Gly Glu 
Ala Ala Thr Pro Gly Pro Ser Val Asp Pro Pro Ser Val Glu Arg He 
Lys Thr Thr Ser Ser Val Glu Phe Ala Arg Leu Gin Phe Thr Tyr Asp 

10 His He Gin Arg His Val Asn Asp Met Leu Gly Arg lie Ala Thr Ala 

Trp Cys Glu Leu Gin Asn Arg Glu Leu Thr Leu Trp Asn Glu Ala Arg 
Arg Leu Asn Pro Gly Ala He Ala Ser Ala Thr Val Gly Arg Arg Val 
Ser Ala Arg Met Leu Gly Asp Val Met Ala Val Ser Thr Cys Val Pro 
Val Ala Pro Asp Asn Val lie Met Gin Asn Ser He Gly Val Ala Ala 

15 Arg Pro Gly Thr Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg Tyr Glu 

Ala Asp Gly Pro Leu Val Glu Gly Gin Leu Gly Glu Asp Asn Glu He 
Arg Leu Glu Arg Asp Ala Leu Glu Pro Cys Thr Val Gly His Arg Arg 
Tyr Phe Thr Phe Gly Ala Gly Tyr Val Tyr Phe Glu Glu Tyr Ala Tyr 
Ser His Gin Leu Gly Arg Ala Asp Val Thr Thr Val Ser Thr Phe He 

20 Asn Leu Asn Leu Thr Met Leu Glu Asp His Glu Phe Val Pro Leu Glu 

Val Tyr Thr Arg Gin Glu He Lys Asp Ser Gly Leu Leu Asp Tyr Thr 
Glu Val Gin Arg Arg Asn Gin Leu His Ala Leu Arg Phe Ala Asp He 
Asp Thr Val He Lys Ala Asp Ala His Ala Ala Leu Phe Ala Gly Leu 
Tyr Ser Phe Phe Glu Gly Leu Gly Asp Val Gly Arg Ala Val Gly Lys 

25 Val Val Met Gly He Val Gly Gly Val Val Ser Ala Val Ser Gly Val 

Ser Ser Phe Leu Ser Asn Pro Phe Gly Ala Leu Ala Val Gly Leu Leu 
Val Leu Ala Gly Leu Ala Ala Ala Phe Phe Ala Phe Arg Tyr Val Met 
Arg Leu Gin Arg Asn Pro Met Lys Ala Leu Tyr Pro Leu Thr Thr Lys 
Glu Leu Lys Ser Asp Gly Ala Pro Leu Ala Gly Gly Gly Glu Asp Gly 

30 Ala Glu Asp Phe Asp Glu Ala Lys Leu Ala Gin Ala Arg Glu Met He 

Arg Tyr Met Ala Leu Val Ser Ala Met Glu Arg Thr Glu His Lys Ala 
Arg Lys Lys Gly Thr Ser Ala Leu Leu Ser Ala Lys Val Thr Asp Ala 
Val Met Arg Lys Arg Ala Arg Pro Arg Tyr Ser Pro Leu Arg Asp Thr 
Asp Glu Glu Glu Leu 

35 

SEQ ID N0:24: 

GCTGTTCAGA TTTGACTTAG AYMANMCMTG YCC 33 



SEQ ID N0:25: 

40 GTCTACAAGA AGAACATCGT GCCNTAYATN TTYAA , 35 
SEQ ID NO: 26: 

GTCTACAAGA AGAACATCGT GCC 23 

45 SEQ ID N0 :27: 

AACATCTCTA CAATCTCACA RTTOACNGTW GT 32 



SEQ ID N0:2S: 

AACATGTCTA CAATCTCACA 20 

50 

SEQ ID N0:29: 

AATAACCTCT TTACGGCCCA AATTCARTWY GCNTAYGA 38 
SEQ ID N0:30: 

55 CCAACGAGTG TGATGTCAGC CATTTAYGGN AARCCNGT 38 



SEQ ID NO: 31: 

CCAACGAGTG TGATGTCAGC C 21 

60 SEQ ID N0:32: 

TGCTACTCGC GACCTCTAGT CACCTTYAAR TTYRTNAA 38 

SEQ ID N0:33: 

I0H 
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TCCTACTCGC GACCTCTAGT CACC 24 
SEQ ID N0:34: 

ACCGGMGTAC AGTTCCACTC TYTTOAARTC QATRTT 36 

5 

SEQ ID N0:35: 

TGTCACCTTC ACATGAGGCC A 21 
SEQ ID N0:36: 

10 TTTGACCTGG AGACTATCTT YMGNGARTAY M 32 

SEQ ID N0:37: 

GCT C TGGGTG TAGTAGTTRT AYTCYCTPAA CAT 33 

15 SEQ ID N0:38: 

TCTCGGAACA TGCTCTCCAG RTCRAAMACR TT 32 



20 



SEQ ID NO: 39: 

ACCTTCATCA AAAATCCCTT NGGNGGNATG YT 32 
SEQ ID NO: 40: 

TGGACTTACA GGACTCGAAC NACNCTNAAY TG 32 
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Claims 

What is claimed is: 

1. An isolated polynucleotide with a region encoding a Glycoprotein B of a herpes virus, the 
polynucleotide comprising a sequence at least 65% identical to nucleotides 36 to 354 of SEQ. ID 
NO:1orSEQ.IDNO:3. 

2. An isolated polynucleotide comprising a fragment of at least 50 consecutive nucleotides of the 
Glycoprotein B encoding region of the polynucleotide of daim 1 . 

3. An isolated polynucleotide with a region encoding a Glycoprotein B of a herpes virus, the 
polynucleotide comprising a sequence selected from the group consisting of. a sequence of 35 
nucleotides at least 74% identical to oligonucleotide SHMDA (SEQ. ID NO:41); a sequence of 30 
nucleotides at least 73% identical to oligonucleotide CFSSB (SEQ. ID NO:43); a sequence of 29 
nucleotides at least 72% identical to oligonucleotide EMTFA (SEQ. ID NO:45); and a sequence of 
35 nucleotides at least 80% identical to oligonucleotide DNIQB (SEQ. ID NO:46). 

4. An isolated polynucleotide comprising a fragment of at least 50 consecutive nucleotides of the 
Glycoprotein B encoding region of the polynucleotide of claim 3. 

5. The polynucleotide of claim 1 or claim 2, wherein said herpes virus is capable of infecting 
primates. 

6. The polynucleotide of claim 1 or claim 2. wherein said herpes virus is RFHV1 , RFHV2, or KSHV. 

7. An isolated polynucleotide comprising a linear sequence of at least 21 nucleotides identical to a 
linear sequence between nucleotides 36 to 354 inclusive of SEQ. ID NOS:1 , 3, or 92 or anywhere 
within SEQ. ID NO:96; but not in SEQ. ID NO:98. 

8. The isolated polynucleotide of claim 7, comprising a linear sequence essentially identical to 
nucleotides 36 to 354 of SEQ. ID NO:1, SEQ. ID NO:3, or SEQ. ID NO:96, or at least 200 
consecutive nucleotides in SEQ. ID NO:92. 

9. An isolated polypeptide encoded by the polynucleotide of claim 2. 

10. An isolated polypeptide, comprising a linear sequence of at least 17 amino acids essentially 
identical to a sequence between amino acids 13 to 118 inclusive of SEQ. ID NO S:2, 4, or 97 or 
anywhere within SEQ. ID NO:94; but not in SEQ. ID NO:99. 

1^ 
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1 1 . The isolated polypeptide of claim 1 0, consisting essentially of a polypeptide having the sequence 
shown in SEQ. ID NO:2, SEQ. ID NO:4, SEQ. ID NO:94, or SEQ. ID NO:97 

12. A fusion polypeptide comprising a linear sequence of at least 17 amino acids identical to that 
comprised in the polypeptide according to claim 10. 

13. The isolated polypeptide of daim 10, implicated in the binding or fusion of herpes virus with a 
mammalian cell. 

14. The isolated polypeptide of claim 1 0, which is glycosylated. 

1 5. The isolated polypeptide of claim 1 0, which is not glycosylated. 

16. The isolated polypeptide of claim 10, which is immunogenic. 

17. The isolated polypeptide of claim 10, comprising a sequence selected from the group consisting 
of SEQ. IDNOS:67-76. 

18. An isolated polynucleotide encoding the polypeptide of daim 1 0. 

19. A norvnaturally occurring polynucleotide encoding the polypeptide of claim 1 0. 

20. A polynucleotide encoding a fusion polypeptide, comprising the polynucleotide of claim 2 joined 
directly to a second polynucleotide encoding a polypeptide. 

21. A recombinant cloning vector comprising a polynucleotide sequence encoding a linear sequence 
of at least 1 7 amino acids identical to that comprised in the polypeptide according to claim 1 0. 

22. A recombinant expression vector comprising a polynucleotide sequence encoding a linear 
sequence of at least 17 amino acids identical to that comprised in the polypeptide according to 
claim 10, operatively finked to a control polynucleotide sequence. 

23. A recombinant cloning vector comprising a linear sequence of at least 21 nucleotides identical to 
a linear sequence comprised in the polynucleotide of claim 7. 

24. A host cell transformed by the polynucleotide of claim 1 8 or claim 1 9, or by the vector of claim 21 , 
claim 22, or claim 23. 
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25. A monoclonal or isolated polyclonal antibody specific for a Glycoprotein B polypeptide encoded in 
said encoding region of the polynucleotide of claim 1 . 

26. A monoclonal or isolated polyclonal antibody specific for the polypeptide of claim 1 1 , but not for a 
polypeptide having an amino acid sequence of any of SEQ. ID NOS:30-41 .. 

27. The antibody of claim 26, which is a monoclonal antibody. 

28. The antibody of claim 26, which is an isolated polyclonal antibody. 

29. A vaccine comprising the polypeptide of claim 9 or claim 10 in a pharmaceutical^ compatible 
excipient. 

30. The vaccine of claim 29, also comprising an adjuvant 

31 . A method of treating a herpes virus infection, comprising administering the vaccine of claim 29. 

32. A vaccine comprising the polynucleotide of claim 2 or claim 7 in a pharmaceutical^ compatible 
excipient 

33. The vaccine of claim 32, which is a live vims or viral expression vector. 

34. A method of treating a herpes vims infection, comprising administering the vaccine of claim 32. 

35. A vaccine comprising the antibody of claim 25 or claim 26 in a pharmaceutical^ compatible 
excipient 

36. A method of treating a herpes vims infection, comprising administering the vaccine of claim 35. 

37. An oligonucleotide essentially identical to an oligonucleotide selected from the group consisting of 
SEQ. ID NOS24-63, SEQ. ID NOS:77-78, and SEQ. ID NOS:80-90. 

38. A method of obtaining an amplified copy of a polynucleotide encoding a Glycoprotein B, 
comprising the steps of. 

a) contacting the polynucleotide with the oligonucleotideof claim 37; and 

b) elongating oligonucleotide that has formed a duplex with the polynucleotide. 

39. The method of daim 38, wherein said amplification reaction is a polymerase chain reaction 
(PCR). 
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40. The method of claim 39, wherein said PGR comprises repeated cycles of annealing and 
elongating, and the annealing is conducted at a temperature of at least 500C. 

5 41. The method of claim 38, wherein the polynucleotide which is amplified is first obtained from a 
biological sample taken from an individual affected with a disease featuring fibroblast proliferation 
and collagen deposition. 

42. A method of detecting viral DNA or RNA in a sample, comprising the steps of: 

10 a) contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of 

claim 2 or claim 7 under conditions that would permit the probe to form a stable duplex with 
at least one polynucleotide having a sequence shown in SEQ. ID NOS:1, 3, 92 or 94, but not 
with a polynucleotide having any of the sequences shown in SEQ. ID NOS:5-1 3; and 
b) detecting the presence of said stable duplex formed in step a), if any. 

15 

43. A method of detecting viral DNA or RNA in a sample, comprising the steps of 

a) contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of 
daim 2 or claim 7 under conditions that would permit the probe to form a stable duplex with 
a polynucleotide having the sequence shown in SEQ. ID NO:1, and with a polynucleotide 

20 having the sequence shown in SEQ. ID NO:34, but not with a polynucleotide having any of 

the sequences shown in SEQ. ID NOS:5-13; and 

b) detecting the presence of said stable duplex formed in step a), if any. 

44. A method of detecting viral DNA or RNA in a sample of primate origin, comprising the steps of 

25 a) contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of 

claim 2 or claim 7 under conditions that would permit the probe to form a stable duplex with 
at least one polynucleotide having a sequence shown in SEQ. ID NOS:1 , 3, 92 or 94. but not 
with a polynucleotide having any of the sequences shown in SEQ. ID NOS:5 or 9-13; and 
b) detecting the presence of said stable duplex formed in step a), if any. 

30 

45. The method of claim 42 further comprising conducting an amplification reaction on the DNA or 
RNA of the sample prior to being contacted with the probe. 

46. A method of detecting viral DNA or RNA in a sample, comprising the steps of 

35 a) conducting an amplification reaction on a polynucleotide in the sample using the 

oligonucleotide of daim 37 as a primer in the reaction; and 
b) detecting the presence of amplified copies of the polynucleotide, if any. 
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47. An isolated polynucleotide capable of forming a stable duplex with a second polypeptide 

comprising a sequence selected from the group consisting of SEQ. ID NOS:1, 3, 92, and 94, 
and their respective complementary sequences, under conditions wherein the second 
polynucleotide is capable of forming a stable duplex with at least one polynucleotide having 
a sequence shown in SEQ. ID NO:1, 3. 92 or 94, but not with a polynucleotide having a 
sequence of any of SEQ. ID NOS:5-13. 

48. The isolated polynucleotide of claim 47, the nucleotide sequence of which is contained in the 
genome of a naturally occurring vims. 

49. An isolated polypeptide comprising a linear sequence of at least 25 amino acids encoded within 
the polynucleotide of claim 47. 

50. A method for detecting infection of an individual by a herpes virus, comprising detecting viral DNA 
or RNA in a biological sample obtained from the individual, wherein the detecting of viral DNA or 
RNA is by the method of daim 42, daim 44, or daim 46. 

51 . A diagnostic kit for detecting a herpes virus polynudeotide in a biological sample, comprising a 
reagent in suitable packaging, wherein the reagent comprises the polynudeotide of daim 2or 
daim 7. 

52. A diagnostic kit for detecting a herpes virus polynudeotide in a biological sample, comprising a 
reagent in suitable packaging, wherein the reagent comprises the oligonucleotide of daim 37. 

53. A method of detecting Infection of an individual by a herpes virus, comprising the steps of. 

a) contacting antibody from a sample obtained from the individual with the polypeptide of daim 
25 or daim 26 under conditions that permit the formation of a stable antigen-antibody 
complex; and 

b) detecting said stable complexes formed in step a), if any. 

54. A diagnostic kit for detecting an anti-herpesvims antibody present in a biological sample, 
comprising a reagent in suitable packaging, wherein the reagent comprises the polypeptide of 
daim 25 or claim 26. 

55. A method of detecting infection of an individual by a herpes vims, comprising the steps of: 

a) contacting a polypeptide from a sample obtained from the individual with the antibody of 
daim 9 or claim 10 under conditions that permit the formation of a stable antigen-antibody 
complex; and 

b) detecting said stable complexes formed in step a), rf any. 



HO 



WO 97/12042 



PCT/US96/15702 



56. A diagnostic kit for detecting a herpes virus polypeptide present in a biological sample, 
comprising a reagent in suitable packaging, wherein the reagent comprises the antibody of claim 
9 or claim 10. 

57. A method of determining whether a pharmaceutical candidate is useful for treating gamma herpes 
infection, comprising the steps of 

a) contacting the polypeptide of claim 9 or claim 1 0 with the pharmaceutical candidate; and 

b) determining whether a biochemical function of the polypeptide is altered by the 
pharmaceutical candidate. 

58. The method of claim 57, wherein the biochemical function of the polypeptide determined in step 
b) is the binding of the polypeptide to the surface of a mammalian cell. 

59. A method of producing a Glycoprotein B polypeptide, comprising expressing the polynucleotide of 
claim 2 or claim 7 in a eukaryotic cell. 

60. A method of producing a polynucleotide comprising a region encoding herpes virus Glycoprotein 
B, comprising replicating the recombinant cloning vector of claim 21 or claim 23. 
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Figure 1(a) 



VYKKN I V P Y I F K V R R Y yt] K I A 
REHV GTGTACAAQAAQAACATCQTGCCQTACATTTTC^ ^ €0 

KSHV QTQTACAAGAAGAACATCQTQCCQTATATTTTTA;^^ 60 

VYKK HIVPYIFKVRRYg|}KIA 

(NIVPA> ) gtgtacaagaagaacat cgtgccntayatnttyaa 

(<GMTEB) gcgccatactgtcttcgtcgt 

(<AAITB) tcgtcgttagtgtttgtttat 
(GOTEA>) gcggtatgacagaagcagcaa 

(KYEIA> ) aacaaatatgagatc- 
TSVTVYRO[g|TE[g]AITNXYE[X] 
RFHV ACATCTSTCACGQTCTACCGCQQTATQACAGAAGCJU3CAAT^ 120 

KSHV ACCTCTGTCACGQTCTACAGGQGCTTGftCJU^ 120 

T SVTVY RO[gTEg)Al TNKY E [L] 

cgaactgtctcaggcggtagt (<GLTEB) 

(<TNKYB) ggtagtggttgttcatacttg 
gggcttgacagagtccgccat {GLTEA>) 

(YELPA>) acaagtatgaactc- 



-cccagg (KYEIA>) 
PRPVPLYE ISBMDSTYQCFS 
RFHV CCCAGGCCCQTQCC TC TC TA CGAGATCAGTCACATGQACATC 180 

KSHV CCQAOACCCGTQCCACTCTATGAGATAAGCCACATGGACAGCACCTATCAO 180 

PRPVPLYE ZSHMDSTYQCFS 

-ccgagac (YELPA>> 

agacccgtgccactctatgarathagycayatgga (SHMDA>) 

KCFSSB} acraartcr- 



(<TDRDB) atgcaagtggctagccctact 
SM Kpf|py|VMQVENTFTPRDDVW 
RFHV TCCATGAAAATTGTAQTQAACGQAQTCQAAAATACQCT 240 



KSHV TCCATGAAQQTAAATOTCAACQGQQTAQILAAACACATTTACTGACAGM 240 

SMK[||g|jVNGVENTFTDRDDVN 

cttccatttacagttgcccca (<VNVNB) 

(<TFTDB) gtgtaaatgactgtctctgct 
-agntacttctaacaacacttg (<CFSSB) 

(ENTFA>) gtcaacggagtagaxaayacnttyacnga 
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Figure Kb) 



(<VEGLBJ gcagcttccagattgactgtt 
(VEGIA>) cccgtcgaaggtctaactgac <PVLYA>) agc- 

HTVPLQPVBOIiTDHIQRT.FS 

* ** ***** ***** ** ** ** ** ** ** ** ***** ***** **** 

KSBV JtfXACAOTATTCCTCC^^ 300 

PR TVFLQPVEGLTDN I QRYFS 

ccacagtattcctccaaccag (TVFIA>) <SQPVA>) ggtactttagc- 

KDNIQB) tgnctrttrtaagtttccatgaaatcg- 



-caaccagtactgtactct (PVLYA>) 
QPV|E]Y[S]EPGWFPGIYRV[g)TT 

KFHV CAACCAOTACTQTACTCTQAACCCGGATGQTTC 360 

KSHV CAGCCGGTCATCTACGCGQAACCCOGCTGQTTTCCCGGCATATACAQAQTTAG 360 

Q p Vl 1 I E PGWFPG I Y R V|| T T 

-cagccggtca (SQPVA» «TVNCB) tgntgn- 

-gtcggtca (<DNIQB) 

VNCEIVDM 
RFHV OTAAACTGTGAGATTGTAGACATGTT 386 

KSHV GTCAACTGTGAQATTGTAOACATGTT 386 

VHCEIVDM 

-canttracactctaacatctgtacaa { <TVNCB) 
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Figure 2 



Glycoprotein B 
encoding region 



FRFDA NtVPA TVNCA FA YD A IYGKA CYSRA 



FREYA 



TVNCB 



NIDFB NVFDB FREYB 



-1700 bp 



-430 bp 



t -360 bp 



-600 bp ^_ 



-450 bp l_ 



-330 bp 



* -1600 bp 



-1060 bp 4 



f -1000 bp 
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Figure 3(a) 



KSHV 

HVS 

bHV4 

OHV68 

EBV 

hHV6 

hHVSl 



Signal -Peptide Domain 
HTPRSRCA 



MYYKTICE 
HYPTVKSH 
HTRRRW5 



KSHV 

HVS 

bHV4 

nHV68 

EBV 

hHV6 

hHVSl 



-^ssiafsar 




WEKNKTQAI - 

PSISSTTSSTTTSTSKPSNTTSTN- 



OXH1 ULYVCQI 



SAI3KVCSFJ 

VVVtDMlJOLGAQTPJSQI 
LAVF1MNSVLHIYCDPW YIRAGYN 
CRHFWWALLGLUGVL\ ftSAAPSSPGTPGVAAATQAANQGPATPAPPAPGPAPTGDTKPKKIK 



LYVCQPTTLRQPSDMTPAQDAPTETPPPLS7NTN - 
PAPPATTVQPTATRQ 



KSHV ASGPKSVDFYQFRVCSASIT-GELFRFNLEQTC 

HVS YQEYFKYRVCSASTT-GELRFDLDRTC 

bHV4 SSUWSPQNT5TSI<PSTDN(XnSTPTIPTV^ 

ntiV68 RJHLYVCQPTTLKQPSDMTPAQDAPTFTPPP^ 

EBV Ql^mLLAAlACRLGAQTPEQPAPPATrVQPTATRQQTSFPFRVCELSSH -GDLRFSSDKE 

hHV6 - - HKYPFRICSIAKG-TDLMRFDRDISC 

hHSVl KPKNPPPPI^AGDNATVAAGHATLREHLRDIKAENTDANFY- - -VCPPPTG-ATVVQFEQPRRC 



NIVPA> 

RFHVHn VRRYIKIATSVTVYRGM- -TEAAITOCYETPRPVPLY 

RFHVHm VRRYRKVATPVTLYRGM- -TDAAITNKYEIPRPVPLY 

KSHV PDTKDKY-HQEGILLVYKKNIVPHIFKVRRYRKIATSV7VYRGL- - TESAITNKYELPRPVPLY 

HVS PSTEDKV-HICEGIUVYKKNIVPYIFKVRRYIW 

bHV4 PDTKDKK - HVEG I LLV LKKN I VP Y I FX VRKYRKI ATS VTVYRGW - - SQAA VTNRDD I SRAI P YN 

rfW68 PSTQDKK-HVEGILLVYKINIVPYIFKIRRYRKIITQLTIWRGL- -TT5SVTGKFEHATQAHEW 

EBV PSFGmNHTEGLWVFXDNIIPYSFKVI^miVTNILIYNGW- - YADSVTNRHEEKFSVDSY 

hHV6 SPYKSNAKMSEGFFIIYKTNIETYTFPVRTYKKELTFQSSYRDV- - GW YFLDRTVHGLAMPVY 

hHSVl PTRPEGQNYTEGIAVVFXENIAPYKFKATMYYKDVTVSQVWFGH-RYSQFMGIFEDRAPVPFEE 



RFHVHn EISHMDSTYQCFSSHKIWNGVEMTFTORDDVNICTVFLQPVEGLTDNICKYFSQ- -PVLYSEPG 
RFHVHm EISHHDSTYttFSSHKI VVNGVEMTFTGROOVNKSVFLQPVEGLTDN1I(RYFSQ- - PVLYSEPG 
KSHV EISIWDSTySfSSHKVNVNGVENTFTDRDDVNTTVFLQPVEGLTDNIQRYFSQ- • PVIYAEPG 



HV1 EIDIHDKTY( 

bHV4 EISMIDRTY* 

mHV68 EVGDFDSIYC 

EBV ETDQMDTIYC 

hHV6 EANLVNSr " 

hHSVl VIDKINAKGV 



inchqievnghlnsyydrdgnnktvdlkpvogltgaitryisq- - pkvfadpg 
Ifsahatvimgilntyidrdsehksvplqpvaglteninryfsq-pliyaepg 
ynsatmwnnvrqvyvdrdgvnktvnirpvdgltgn1qryfsq--ptlysepg 
;ymavkmtxdgltrvyvdrdgvnitvnlxptgglangvrryasq - -TELYDAPG 
ysavamicrpdgtvfsafhednnknmtlnlfplnfksitnkrf ittkepyfarg 
takyvrhnlettaf^dhetdhelkpamaatrtsrgwh- -ttdlkyn-ps 
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Figure 3fb) 



RFHVNn UFPGIYRVG 
RFHVMm . WFPGI YRVR 

KSHV WFPGIYRVRTTVNCEI VDHIARSAEPYNYFVTSLGDTVEVSPFfiYNESS - 



•CSTTPSNKN-GLS 



HVS WLWGTYRTRTTVI 

bHV4 WFPGIYRVRTTVI 

HHV68 WMPGFYRVRTTVNj 

EBV WLIVfTY RTRTT Vl 

hHV6 P-LWLYSTSTSI 

hHSVl RVEAFHRYGTTVN 



;eivdmfarsadpytyfvtalgdtvevspfcdvons- -CPNAT- - • -DVLS 

lEWDMYARSVEPYTHFITALGDTIEISPFCH- -MNS(£nGMSTSRDA7K 
;EIVDHVARSHDPYNYIATAL(3)SLELSPFQ7FDNTSSTAPKRA- -DHRV 

LITOWmSNSPFDFFYn^TYEMSPFYDO^KETFHE RAOS 

,IVTEATAKAKYPFSYFALTTGEIVEGSPFFNGSNGKHFAEPLEK- -LTIL 
IVEEVDARSVYPYDEFVUTGDFVYMSPFYGYREGSH7EHTSYAA0RFKQ 



KSHV VQV VLNHTWTYSDRGTSPTPQNR I FVETGAYTLSWASESKTTAVCPLALWKTFPRS I QTTHED 

HVS VQIDlilHTVVDYGNRATSQQHKKRIFAHTLDYSVSWEAVNKSASVK 

bHV4 VWIEENHQTVDY-ERRGHPTXDKRIFLXDEEmSWK^ 

mHV68 RE VK - NYKRDYNNRGT/^A(^RTFL£TPSATYSWmTRQTATCDLVHW^PRAIQTAHEH 

EBV FHVRTNYKIVDYDNRGTNPQGERRAFLDKGTYTC 

hHV6 ENYTHIEDLMNG-MNGAmVRKIAREKADTU^VO 

hHSVl - VOGFYARDLTTKARATAPTTRM - - LLTTPKFTVAWDWVPKRPSVlTMTKWQEVDEHLRSEGGG 



KSHV SFHFVANEITATFTAPLTP- - - VANFTDTYSCLTSDimTlilA-SKAKLASTHVPNGTVQYFHT 

HVS TYHFIANEITAGFSTVKEP - - -lANFTSDYIKLHTHIKrTlfD-KIARVNNTKrPNGTAEYYqT 

bHV4 SFHFVANEVTASFL7SNQEETEUIGHTEIUCHNSTINETLEE -TVKKFNKSHIRDGEVKYYKT 

rtW68 SYHFVANEVTATFNTPLTE - - -VENFTSTYSCVSDQINKnSE-YIWUJNSYVASGKTQYFKT 

EBV SIHVFTDEGTSSFV7NTTV - - -GIELPDAFKfilEEQVNlCfflHEKYEAVQORYTKGQEAITYFIT 

hHV6 TYHFISKELTAAFVAPKES- - -LNL7DPKQTCIKDEFEKIINEVYMSDYNDTYSMNGSYQIFKT 

hHSVl SFRFSSDAISTTFTTNLTE - • YPLSRVDLGD£IGKDARDAMDRIFARRYNATHIKVGQPQYYLA 



KSHV TGGLYLVWQPMSAI NLTHAQGDSGNPTSSPPPSASPHTTSASRRK 

HVS EGGMILVWJPLIAIELEEAMLEATTSPVTPSAPTSSSRSKRAI 

bHV4 NGGLFLIWQAHKPLNLSE HTNYTIERNNKTGNKSRQK 

0HV68 DGNLYLIWQPLEHPEIEDIDEDSDPEP-TPAPPKSTRRKREAA 

EBV SGGLLLAWLPLTPRSLATVKNLTELTTPTSSPPSSPSPPAPSAARGSTPAAVU^RRRRDAGNAT 

hHV6 TGDLILIUQPLVQKSL- - HFLEQGSEKIRRRROVVDVKSRHDI 

hHSVl NGGFLIAYQPLLSNTIAELYVREHLREQSRKPPNPTPPPPGAS 



FAYDA> 

KSHV PJ^ASTAAAGGGGSTDNLSYTQLQFAYDKU^INQVl£ELSRAWCREQVRDNLMWYELSKINP 



HVS 
bHV4 



- - - -RSIRDVSAGSENNVFLSQIQYAYDKLRQSINNVLEELAITVCREQVRQTHVWYEIAKINP 



- - - -RSVDTKTFQGAKGLSTAQVQYAYDHLRTSHNHILf EL7 

BHV68 -DNGNSTSEVSKGSENPLITAQIQFAYDKLTTSVNNVLEEL' 

EBV TPVPPTAPGKSLGTLNNPATVQIQFAYDSLRRQINRMLGDL 

hHV6 - -LYVQLQYLYDTLKDYINDALGNLAES 

hHSVl ANASVERIKTTSSIEFARlJQFTYNHIQRHVNDHLGRVAI/> 

* * * * 



EQKKDNLMWYELSKINP 
, EQVRDTLMWYELSKVNP 
CLEQKRQNMVLRELTKINP 
mLDQKRtITNLHELSKISP 

:elqnheltlwnearxlnp 

* * * * 
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Figure 3(c) 



IYGKA> CYSRA> 
KSHV TSVMTAIYGRPVSAKFVGDAISVTECIMVDQSSVNIHKSLRTNSK DVCYARPLVTFKF 

HVS TSVHTAI YGKPVSRKALGDVISVTECINVDQSSVS IHKSLXTENN DICYSRPPVTFKF 

bHV4 VSVMMIYGKPVAVKAMGDAFMVSECINVDQASVNIHKSMR7DDP KVCYSRPLVTFKF 

D1HV68 TSVMSAIYGICPVAARYVGDAISVTDCI YVDQSSVN IHQSLRLQHD KTTCYSRPRVTFKF 

EBV TTVI«SIYGKAVAAKIU.GDVISVSIKVPVIWTVTLRKSMRVPGS ETMCYSRPLVSFSF 

hHV6 SSIVSEVYGRPISAQLHGDVUISI^IEVWSSVQIJKSMRVVDAKGVRSETWCYNRPLVTFSF 

hHSVl NAIASATVGRRVSARMLGDVHAVSTCVPVAADNVI VCWSWISSR PGACYSRPLVSFRY 

* ** * ****** 



KSHV LNSS - NLFTGQLGARNEI I LTNNQVETCKDT£EHYFITRNETLVYKDYAYLRTIWrroiSTLNT 



HVS VNSS-QLFKGQLGARNEILLSESLVEN 

bHV4 VNST-ATFRQQLGTRNEILLTMTHV - 

DHV68 INST-DPLTGQLGPRKEIILSNTNIE 

EBV INDT-KTYESQLSTDNEIFLT 

hHV6 VNSTPEVVPGQLGLDNEILLGDHRTEE_ 

hHSVl EOQG - PLVEGQVGENNELRLTRDAIEP£ 
** * * * ■ 



rAKNETYHFKNYVHVETLPVNNISTLDT 

?PTADHYFFVKNMTNYFKDYKFVKTHDTNNISTLDT 

1ESEHYFIVGEYIYYYKNYIFEEKLNLSSIATLDT 

, \TSQYYFQSGNEIHVYNDYHHFKTIELDGIATLQT 

[IPSTCIFLSGNHAHVYTDYTHTNSTPIEDIEVLDA 

1GHRRYFTFGGGYVYFEEYAYSHQLSRADITTVST 
* * 



<FREYB 

<NIDFB <NVFDB FREYA> 

KSHV FI/UJ^FIWIDFKAIELYSSAEKRLASSVFDLETMTO 

HVS FI^NLTFIENIDFKAVELYSSGERKI^-NVFDLETMREYNYYAQSISGLRKDFDNSQRNNRO 
bHV4 FLTLNLTFIDNIDFIO\ELYSETERKMA-SAIJ)LET>)REYNYYTQKLASLfiEDLDNTIDLNRD 
nHV68 FIAUIISFIENIDFICTVELYSSTERKLASSVFDIESMFREYNYYTYSLAGIKKDLDMTIDYNRD 
EBV FISLNTSLIENIDFASLELYSRDEQRAS-NVFOLEGIFREYNFQAQNIAGLRKDLDNAVSNGRN 
hHV6 FIRLKIDPLENADFKVLOLYSPOELSRA-NVFDLENILREYNSYKSALYTIEAKI • - - - AANTP 

hHSVl FIDLNI7ML£OHEFVPLEVY7RHEIIQSGLU)YTEVQRRNQLilDLRFADIDTVIHADAMAAHV- 
* * * * * 



RIIQDFSEILADLGSISKVIVNVAS6AFSLFGGIVTGILN FIKNP £6HFTFLLI6A 



KSHV RFVRDI^EIVADLGGIQCTVVNVASSVVTLCGSLVTGFIN. 

HVS 
bHV4 
mHV68 
EBV 
hHV6 
hHSVl 



FIKHP J3GHUUIIVIA 



RLVKDLSEMMADLGDIGKVVVMTFS6IVTVFGSIVGGFVS FFTNP GGVTHUUV 

RLVQOLSOMMADLGDIGRSWNVVSSWTFFSSIVTGFIK FFTNP GGIFILLIIQG 

QFVOGLGELMDSLGSVGQSITNLVSTVGGLFSSLVSGFIS FFKNP -GGHLILVLVAG 

SYVNGINSFLQGLGAIGTGLGSVISVTAGALGDIVGGWS FLKNP r GGGLHLILAIV 

IHADANAAMFAGLGAFFEGHGDLGRAVGKV\^IVGGVVSAVSGVSSFMSNPFGALAVGLLVLA 



Membrane-spanning dona 



KSHV 

HVS 

bHV4 

mHV68 

EBV 

hHV6 

hHSVl 



IIUIFNL SRRTNTIAQAPVKMI YPDVDRRAPPSGGAP - 
VIILVIL1 



VWII1 .._ 
GLAAAFFAfR' 



TREEIKNILLGHHQLQQE 



VRRTNNHSQAPIRMIYPDVEKSKSTVTP HEPETIKQI LLGHHNHQQE 

WFVVFIV SRRTNWiNEAPIKHIYPNIDKASEQENIQP LPGEEIKRILLGHHQLQQS 

.^soniiw 

WILVISIjrRRTRQMSCnPV^ 

flRHVI^IDHHFPYATNPVTmSVTGTTVVK^^ 

MRLQSNPWCALYPLTTXELKNPTNPOASGEGEEGGDFDEAKLAEAREMIRYM 
* * 



IVV "VRORHV 
FFAFRYVMRL 
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Figure 3fd) 



KSHV ERQKADDLKKS7PSVFQRTANG - LRQRLBGYKPLTQSLDISPETGE 

HVS AYKKKEEQRAARPSIFRQAAETFLR-KRSGYKQISTEDKIV 

bHV4 EHGKSEEEASHKPGLFQLLGDGLQLLRRRGYTR- LPTFDPSPGNDTSE7TKJKYV 

nHV68 EKEAQKQLTNSQ'TLWQK-ATGFLRMyyCGYSQ- LPLEDESTSL 

EBV QKRAAQRAAGPSVASRALQAARDRFPGLRRRRtHDPETAAALLGEAETEF 

hHV6 ADVSGQVSDDEYSQEMIJ^LKAIICSLDESYRR-KPSSSESHASKPSLIDRIRYRGYKSVNVEEA 

hHSVl ALVSAHERTCHKAKKKG7SALLSAK\ni)MVMRKRRIfrNYTQVPNKDGDADEDDL 
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Figure 4 



EBV GGCGACCTGTTCCGCTTCTCCTCGGACATCCAGTGTCCC 

SHV1 GGAGAATTGTTTAGATTTGATTTAGACAGAACTTGTCCA 

mHV68 GGGGAGACCTTCAGGTTTGATTTAGACAAAACATGCCCC 

bHV4 GGAGAACTATTCAGATTTGACCTTGATCAGACATGTCCA 



5 • -gctgttcagatttgacttagaymanmcntgycc-3 • 
FRFDA 256-fold 33mer> 



Figure 5 



EBV GAGGGCCTGTTGATGGTGTTTAAAGACAACATTATTCCCTACTCGITTAAG 

sHVl GAAGGCATTCTTTTAGTGTACAAAAAAAATATAGTTCCATATATCTTTAAA 

lriHV6 6 GAGJGGCATCTTGCTCGTGTATAAGATCAACATCGTGCCCTACATCTTCAAA 

bHV4 GAAGGCATCCTGCTGGTACTAAAAAAGAATATTGTCCCATACATCTTCAAA 



5 * -gtgtacaagaagaacatcgtgccntayatnttyaa-3 ' 
NIVPA 64- fold 32mer> 

5 ' -gtgtacaagaagaacatcgtgcc-3 1 
NIVPASQ 23mer> 



Figure 6 



EBV ATATGG ACTT ACAG AACAAG AACTACCGTCAACTGC CTC ATAACT C CAAG 

SHV1 TGGGGAACTTACAGGACTCGAACTACCXyiTAACTGTGAAATTGTAGAC^ 

mHV6 8 CCTGGCTTTTATCGTGTTCGAAC (^CCGTTAACTGTGAAATTGT AGACATGG TGGCACG C 

bHV4 CCAGGGATTTATAGAGTGAGAACAACTGTTAATTC 

5 ' -tggacttacaggactcgaacnacngtnaaytg-3 ' 
TVNCA 128- fold 32mer> 

3 ■ -tgntgncanttracactctaacatctgtacaa-5 ' 
<TVNCB 128 -fold 32mer 

3 • -acactctaacatctgtacaa-5 ' 
<TVNCBSQ 20mer 
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Figure 7 



EBV AACAATCCCGCC71CCGTCCAGATCCAATTTGCCTACGAC 

SHV1 AATAATGTGTTTCTATCACAAATACAATATGCATATGAT 

mHV6 8 AATCCGCTCATTACGGCCCAAATTCAATTTGCCTATGAC 

bHV4 AAGGGCCTGTCCACTGCCCAGGTTCAATATGCCTATGAC 

5 1 -aataacctctttacggcccaaattcartwygcntayga-3 ■ 
FAYDA 64 -fold 38mer> 



Figure 8 

AATCCAACCACCGTCATGTCCAGCATCTACGGTAAGGCGGTG 
AATCCAACAAGTGTTATGACAGCAATATATGGAAAACCTGTC 
AACCCTACGAGTGTGATGTCTGCCATTTATGGAAAGCCTGTC 
AACCCAGTGAGTGTCATGGCAGCCATTTATGGGAAACCTGTG 

5 • -ccaacgagtgtgatgtcagccatttayggnaarccngt-3 ' 
IYGKA 64 -fold 38raer> 

5 ' -ccaacgagtgtgatgtcagcc-3 1 
IYGKASQ 21mer> 



Figure 9 



EBV TGCTACTCG CGCCCCCTGGTGTCCTTCAG CTTT ATCAACGAC 

sHVl TGCTATTCACGGCCTCCAGTTACATTTAAATTTGTTAACAGT 

mHV68 TGCTACTCGAGACCTAGAGTCACCTTCAAATTTATAAACAGT 

bHV4 TGTTACTCCAGACCCCTGGTCACATTTAAATTTGTGAATAGT 

5 ' -tgctactcgcgacctctagtcaccttyaarttyrtnaa-3 1 
CYSRA 64 -fold 38mer> 



5 ' -tgctactcgcgacctctagtcacc-3 ' 
CYSRASQ 24mer> 
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Figure 1Q 



EBV 
SHV1 
mHV68 
bHV4 



' AACATTGACTTTGCCTCCCTGGAGCTGTACTCACGGGACGAACAGCGT 
AATATTGACTTTAAAGCTGTTGAATTGTA^ 

AATATCGACTTCAAAACAGTAGAACTGTACTCCTCTACTGAAAGGAAA 
AATATAGATTT(^GACAGTGGAACTTTACAGTGAGACTGAAAGAAAG 

3 1 -ttrtadctraarttytgtcaccttgacacgaggcca-5 » 
<NIDFB 48 -fold 36mer 



Figure 11 



3 1 -tgtcaccttgacatgaggcca-5 • 
<NIDFBSQ 21mer 



EBV 
sHVl 
mHV68 
bHV4 



AACGTCTTTGACCTGGAGGGCATCTTC 

AACGTGTTTGATTTAGAGACTATGTTTAGAGAATATAACTA7TACGCTCAGAGT 
AGCXSTCnTTGATATAGAATCCATGTTTAGGGAATATAACTATTACACCTACAG^ 
AGTGCCCTCGACCTGGAGACGATGTTTAGAGAGTATAATTACTACACACAGAAG 

5 ' -tttgacctggagactatgttymgngartayaa-3 • 
FREYA 64 -fold 32mer> 



Figure 12 



3 ' -tacaartcyctyatrttgatgatgtgggtctcg-5 
<FREYB 16 -fold 33mer 



3 ' -ttrcanaarctrgacctctcgtacaaggctct-5 ' 
<NVFDB 32- fold 32mer 



EBV 
SHVl 
mHV68 
bHV4 



TTCATCTC CTTCTTCAAAAAC CCCTTCGG CGGCATGCTC 
ATATTAAATTTTATTAAAAATCCTTTAGGTGGCATGTTC 
TTCATTAAATTCTTTACCAACCCTCTAGGGGGAATATTC 
TTTGTCAGTTTTTTCACAAACCCCATTGGGGGCGTGACG 

5 ' -accttcatcaaaaatcccttnggnggnatgyt-3 1 
GGMA 128-fold 32mer> 



10/26 



WO 97/12042 



PCT/US96/15702 



Figure 13(a) 



VRRYRK IATSVTVYRGLTE 

KSHV GGTGCGGCGCTATAGGAAAATTGCCACCTCTGTCACGGTCT71CAGGGGCTTGA - 

RFHV . . .A. .CA.G. .C.TA A. .A. .A C.C. .TA A-- 

bHV4 A. . .A. .AAA A A. .G. - A. .T A. .G.G.T. CC. . -- 

SHV1 A. .CA.AA.A. .C.AA CA.A. .A. .A. . . CGTA . T. TT . AT G. . .TAGAGA 

eHV2 T. .CA.AAAG G. .CATG GAC. . .CA A. . . T.G . .GC. . . — 

mHV68 AA. CA. .A. A A. .A. . AATT. .TCAAC.G. -CA. . . GGC . A . . .CA. . CACT-- 

hEBV . . .C. .CTC. . . C.CC. .G. .A.TG. ; .AACA.TCTCA AT G.TAC.C-- 



SAITN KYELPRPVPLY E IS 

KSHV TCCGCCATCACCAACAAGTATGAACTCCCGAGACCCGTGCCACTCTATGAGATAAG 

RFHV G. A. .A. . . . .A A GA C. -G T C C. . 

bHV4 G. A. .TG.T T.G.G. . . .TA. . AGC . . . G. .A. A. . CTAT A . . . .A. .TTC 

SHV1 AGGTGTT . .T. .T. .A A.GG TT . T .* . .G.T. .T. . .AAA. .... GA 

eHV2 GAT A. .A. . .C. . C . CACGAGGAGCTACG C. .C. .G. .C. . -G.CCA 

mHV68 AGTT.AG TGGT. .A.T A.GG.C.CT.AG.CC . ACGAG.GG. . AG. GG. 

hEBV GA.T. .G.G CG.C.C. . GGAGAA . TTCT TGACAG. . .C. .A.CTGA 



HMDST YQC FSSMKVNVNGVE 

KSHV CCACATGGACAGCACCTATCAGTGCTTTAGTTCCATGAAGGTAAATGTCAACGGGG 

RFHV T C AA.TGTA. .G. .C. .A. .C. . 

bHV4 AATG. .A. .T. .G T. .T. .CTC. G.T. . . GCAAC . GTCA . T . .T. . . A.TCT 

SHV1 TATT T. AG. .T. -C. .A. .TCAT.A. .G. . . .C. .A. .G.A. .A AA.GTT 

eHV2 GATG CA.TAT CG. .G.AC C. .C.AGGGG. . .CAC.T 

mHV68 .G. .T.T T . AC. A. AG . GCC. CCA. GGTG . .A. . .AAC. .CAG 

hEBV . . .G T.C. .T. . .C. . ... . . .AC.ACG.GG.C. . . A.G. CAAAAG.T. . .C.GAC 
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Figure 13(b) 



NTFTDRDDVNTTVFLQPVEG 

KSHV AAACACATTTACTGACAGAGACGATGTTAACACCACAGTATTCCTCC^ 

RFHV . . .T. .G. .C. -C. .TC.G. .T. .C. .A AA. .C .T G. .C. .C. -A. . 

bHV4 G C.A. .TA G. .TTC. .AA. .T.AGT.T. .TCC G G.CC. . 

SHV1 . . .TT. T.ACTA T.GAAA AA. .T. . .GA.T.AA.G. .T T. . 

eHV2 C C.ACTA G GGTGG. . .GAG. .C.CC A G.CC. .T. . 

mHV68 .C.GGTG.A.GTG. T.GG. .C. .T. AA. .T. .GAA.A.A.GC. .T. .T. .T. . 

hEBV GCG.GTG.A.GTA. . .C.C. . . .GA T. . .C. .CAA. . . AA.G . .CACC.G. . . 



LTDNI QRYFSQPV IYAEPGW 

KSHV GCTTACGGATAACATTCAAAGGTACTTTAGCCAGCCGGTCATCTACGCGGAACCCTC 

RFHV T. .A. .T. .C A A A. .A. .AC.G. . .T.T A. . 

bHV4 A. .G. .T. .G TAA.C. .A T. .T. .A. .TC A. .T. .A T 

SHV1 T. .A G.GCA.T.AC. . .A. . .A.T A. .TAAAG.T.TT. .T. ,T. ,T 

eHV2 T. .C. .CTC. .G. .TAACGC.C. .TCAG. .T. .A. .A.AGG.G CACC. . .A. AAA 

mHV68 T. .A. . A.GG. .T. .C A T CAC.C.T. .TT. A T. .T. . 

hEBV C. .GG.CA.CGGGG.G.GCC.C. . .GCC A. . . AGC . . , .T.AC.CC G. . 

FPGIYRVR 

KSHV GTTTCCCGGCATATACAGAGTTAGG 

RFHV . . .C. .A. ,T. .C G...G.. 

bHV4 A. .G. . T. .T G. .A 

SHV1 . C . ATGG . . A . CT ..... GAC .C.A 

eHV2 CC.GTTGT.GTCT. . . .C.ACA. .A 

mHV68 . A. G . .T. . .T.T. .TC.T. . .C.A 

hEBV . . . GATAT . G . CT ACA. .A 



12/26 



WO 97/12042 



PCT/US96/15702 



Figure 14 



KSHV VRRYRKIATSVTVYRGLTES - -AITOKYELPRPVPLYEISHMDSTYQCFSSMKVNVNGVE 

RFHV I. . . M. .A-- I IV 

bHV4 . .K WSQA--.V. . RDDIS .AI . YN. . .MI.R. .H. . .A.ATVI. . IL 

SHV1 K. .T. . .RIFN.W.REGV W. .S.A. .K. . .DIM.K. . . . HNC.QIE. . .ML 

mHV68 I I.QL.IW T . - -SV.G. F.MATQAHEW. VGDF. .1. . .YN.ATOV. .N.R 

eHV2 . .K M. . T . I . K . WS . D- - QHTRSYA VQM. .HY AVQ. . EG.HV 

hEBV . .S.T. . V.NILI .N. WYAD- -SV. .RH.EKFS.DS. .TDQ. .TI. . . YNAV . MTKD . LT 

VTVYRG AITNKYE SHMPSTY VE- 

rgltesa 
rgmteaa 

KSHV NTFTDRDDVNTTVFLQPVEGLTDNI QRYFSQPV I YAEPGWFPGI YRVR 

RFHV K L.S G 

bHV4 . .YI. . .SE.KS.P A. . . E . .N L 

SHV1 ,SYY. . .GN.K. .D.K. .D. . .GA.T. .1. . .KVF.D. . . LW . T . . T . 

mHV68 QVYV. . .G. .K. .NIR. .D. . .G TL .S....M..F.... 

eHV2 . .YY. . .GW.E.A. . K. AD . . .SS.T. .Q. . .EV. . T . RNLLWS . TT . 
hEBV RVYV. . .G. .1. .N.K.TG. . ANGVR. .A. .TEL. DA. . .LIWT. .T. 

-NTFTD TVFLQPV TDNIQRY pviyaep 

pvlysep 
RYFSQP 

(-) Deletions 

{.) Amino acids identical to KSHV 
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Figure 15 



KSHV VRRYRKIATSVTVYRGLTES — AITNKYELPRPVPLYEISHMDSTYQCFSSMKVN 

RFHV VRRYIKIATSVTVYRGMTEA — AITNKYEIPRPVPLYEISHMDSTYQCFSSMKIV 

bHV4 VRKYRKIATSVTVYRGWSQA — AVTNRDDISRAIPYNEISMIDRTYHCFSAMATV 

sHVl VRRYKKITTSVRIFNGWTREGVAITNKWELSRAVPKYEIDIMDKTYQCHNCMQIE 

eHV2 VRKYRKIMTSTTIYKGWSED — AITNQHTRSYAVPLYEVQMMDHYYQCFSAVQVN 

mHV68 IRRYRKIITQLTIWRGLTTS — SVTGKFEMATQAHEWEVGDFDSIYQCYNSATMV 

hEBV VRSYTKIVTNILIYNGWYAD — SVTNRHEEKFSVDSYETDQMDTIYQCYNAVKMT 

hCMV VRVYQKVLTFRRSYAYIHTT — YLLGSNTEYVAPPMWEIHHINSHSQCYSSYSRV 

hHHV6 VRTYKKELTFQSSYRDVGW — YFLDRTVMGLAMPVYEANLVNSHAQCYSAVAMK 

hVZV ATVYYKDVIVSTAWAGSSYT-QITNRYADRVPIPVSEITDTIDKFGKCSSKATYV 

SHVSA8 ATMYYKDVTVSQVWFGHRYS-QFMGI FEDRAPVPFEEVMDKINAKGVCRSTAKYV 

hHSVl ATMYYKDVTVSQVWFGHRYS-QFMGIFEDRAPVPFEEVIDKINAKGVCRSTAKYV 

KSHV V-NGVENTFTDRDDV-NTTVFLQPVEGLTDNIQRYFSQPVIYAEPGWFPGIYRVR 

RFHV V-NGVENTFTDRDDV-NKTVFLQPVEGLTDNIQRYFSQPVLYSEPGWFPGIYRVG 

bHV4 I-NGILNTYIDRDSE-NKSVPLQPVAGLTENINRYFSQPLIYAEPGWFPGIYRVR 

sHVl V-NGMLNSYYDRDGN-NKTVDLKPVDGLTGAITRYISQPKVFADPGWLWGTYRTR 

eHV2 E-GGHVNTYYDRDGW-NETAFLKPADGLTSSITRYQSQPEVYATPRNLLWSYTTR 

mHV68 V-NNVRQVYVDRDGV-NKTVNIRPVDGLTGNIQRYFSQPTLYSEPGWMPGFYRVR 

hEBV K-DGLTRVYVDRDGV-NITVNLKPTGGLANGVRRYASQTELYDAPGWLIWTYRTR 

hCMV I-AGTVFVAYHRDSYENKTMQLMPDDYSNTHSTRYVTVKDQWHSRGSTW-LYRET 

hHHV6 RPDGTVFSAFHEDNNKNNTLNLFPLNFKSITNKRFITTKEPYFARGPLW-LYSTS 

hVZV R-NNHKVEAFNEDKN-PQDMPLIASKYNSVGSKAWHTTNDTYMVAG-TPGTYRTG 

SHVSA8 R-NNMESTAFHRDDH-ESDMALKPAKAATRTSRGWHTTDLKYNPARVEAFHRYGT 

SHSV1 R-NNLETTAFHRDDH-ETDMELKPANAATRTSRGWHTTDLKYNPSRVEAFHRYGT 
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Figure 16 



Gamma 



Beta 



Alpha 



KSHV 

RFHV 

bHV4 

SHV1 

eHV2 

mHV68 

hEBV 

hCMV 

hHHV6 

hVZV 

SHVSA8 

hHSVl 
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Figure 17(a) 



KSHV GGTGCGGCGCTATAGGAAAATTGCCACCTCTC - - 

RFHV . . .A. .CA.G. .C.TA A. .A. .A C.C. .TA A-- 

bHV4 A. . .A. .AAA A A. .6. .A. .T A . . G . G . T . CC . . - - 

SHV1 A. .CA.AA. A. .C.AA CA.A. .A. .A. . .CGTA. T.TT . AT G. . . TAGAGA 

eHV2 T. .CA.AAAG G. .CATG GAC. . .CA A. . .T.G. .GC. . . - - 

mHV68 AA. CA. .A. A A. .A. . AATT. . TCAAC . G . .CA. . .GGC.A. . .CA. . CACT- - 

hEBV . . .C .CTC. . .C.CC. .G. . A.TG . . . AACA . TCTCA AT. . . .G.TACC -- 



KSHV TCCGCC^TCACC^CAAGTATGAACrrCCCGAGACCCGTGCCACTCTATGAGATAAG 

RFHV -■ G.A. .A. ... . .A A GA. ...C..G T C C. 

bHV4 G.A. .TG-T T.G.G TA. . AGC. . .G. .A. A. . CTATA A. . TTC 

sHVl AGGTGTT. ,T. .T. .A A.GG TT.T. . .G.T. .T. . .AAA. . . . GA 

eHV2 GAT A. .A. . .C. . C . CACGAGGAGCTACG C. .C. .G. .C. . .G.CCA 

mHV68 AGTT.AG. . . . TGGT. . A.T A.GG.C.CT.AG.CC.ACGAG.GG. .AG.GG. 

hEBV GA.T. . G . G CG.C.C. . GGAGAA . TTCT TGACAG. . .C. .A.CTGA 

(SHMDA>) agacccgtgccactctatgarathag- 



KSHV CCACATGGACAGCACCTATCAGTGCTTTAGTTCC^ 

RFHV T C AA.TGTA. . G. . C. .A .C. . 

bHV4 AATG. .A. ,T . - G T. .T. .CTC. G.T. . . GCAAC . GTCA . T . .T. . .A.TCT 

SHVl TATT T.AG. .T. .C. .A. .TCAT.A. .G C. .A. .G.A. .A AA.GTT 

eHV2 GATG CA - TAT CG . .G.AC C. .C.AGGGG. . .CACT 

mHV6B .G. .T.T T AC . A. AG . GCC . CCA. GGTG . .A. . .AAC. . CAG 

hEBV . . .G T.C. .T. . .C AC.ACG.GG.C. . .A. G.CAAAAG.T. . .C.GAC 

(ENTFA>) gtcaacggagtaga- 
( <CFSSB ) acraartcragntactt ctaacaacacttg 
ycayatgga (SHMDA>) 
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Figure 17(b) 



KSHV AAACACATTTACTGAC^GAGACGATGTTAACACCACAGTATTCCT 

RFHV ...T. .G..C..C..TC.G..T..C..A AA. .C T G. .C. .C. .A. . 

bHV4 G C.A. .TA G. .TTC. .AA. .T.AGT.T. .TCC G G.CC. . 

SHV1 . . .TT. T.ACTA T.GAAA AA. .T. . . GA.T.AA.G. .T T. . 

eHV2 C C.ACTA G GGTGG. . .GAG. .C.CC A G.CC. .T. . 

mHV68 . C.GGTG.A.GTG T.GG . .C . . T . AA. .T. .GAA.A.A.GC. .T. .T. .T. . 

hEBV GCG.GTG.A.GTA. . .C.C. . . . GA T. . .C. . CAA. . . AA.G. .CACC.G. . . 

-raayacnttyacnga (ENTFA>) 



KSHV GCTTACGGATAAC71TTCAAAGGTACTTTAGCCAGCCGGTCATCTACGCGGAACCCGGCTG 



RFHV 

bHV4 

SHV1 

eHV2 

mHV68 

hEBV 



T. 
A. 
T. 
T. 
T. 
C. 



.A. .T. .C A A A. .A. .AC.G. . . T. T A. . 

.G. .T. ,G. . . .TAA.C. .A T. .T. .A. .TC A. .T. .A T 

.A G.GCA.T.AC. . .A. . . A.T A. .TAAAG.T.TT. .T. .T. .T 

.C. .CTC. .G. .TAACGC.C. .TCAG. .T. .A. .A.AGG.G CACC. . .A. AAA 

.A. .A.GG. .T. .C A T CAC.C.T. .TT.A T. . T. . 

.GG.CA.CGGGG.G.GCC.C. . .GCC A. . . AGC . . . .T.AC.CC G. . 



tgnctrttrtaagtttccatgaaatcggtcggtca (<DNIQB) 



KSHV GTTTCCCGGCATATACAGAGTTAGG 

RFHV . . .C. .A. .T. .C G. . .G. . 

bHV4 A. .G. .T. .T G. .A 

sHVl .C.ATGG. .A.CT GAC.C.A 

eHV2 CC.GTTGT.GTCT. . . .C.ACA. .A 

mHV68 . A.G . .T. . .T.T. .TC.T. . .C.A 

hEBV . . .GATAT.G.CT. ACA. .A 
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Figure 19fa) 



Capsid/Maturation/Transport Gene 

GGMFPIQKMMVSEMIWPSIE 
TGGGGGCATGTTTOXATTCAAAAGATCATGGTATl^^ 60 

RKDWIEPNFNQFYSFENQDI 
GCQGAAGGACTGGATASAGCCWACTTCAACCAGTTCTATAGCTnG 120 

NHLQKRAWEY IRELVLSVSL 
AAACCATCTGCAAAAGAGAGCTTBGGAATATATCA&AGAGCTGGT ATTATCGG 1 1 ICICI 180 

NNRTWERELKILLTPQGSPG 
GAACAACAGMCTTGGGAGAGGGAGCTAAAMTACrTCTCACGCCrcA 240 

FEE PKPAGLTTGLYLTFEIS 
GTTTGAGGAACCGAAACCCGCAGGACTWCAAQ^^ 300 

APLVLVDKKYGWIFKDLYAL 
TraramSGTCTTT^ 360 

LYHHLQLSNHNDSQV* Glycoprotein B Gene 

MTPRSRLATL 
TORTArrj^CACCTffiAACTGA GCAACCACMTGACmCA^ 420 

GTV I LLVCFCAGAAHSRGDT 
TGGGGACTGTCATCCTCnGGTCTGCTTTTGCGCAGGCGCGGCGCACTCGAGGGGTGACA 480 

FQTSSSPTPPGSSSKAPTKP 
CCTTTCAGACGTCCAGTTCCCCCACACCCCCAGGATCTTCCTCTAAGGCCCCCAC^ 540 

GEEASGPKSVDFYQFRVCSA 
CTCGTGAQGAAGCATCTGGTCCTMGAGTGT^ 600 

SITGELFRFNLEQTCPDTKD 
CATCGATCACCGGGCiAGCTTTrrCGGTrWACCTGGAGCAGACGTK^ 660 

KYHQEGI LLVYKKNIVPHIF 
AD^AGTACCACCAAGAAGGAATTTTACTGGTCTACAAAAAAAACATAtrrGC 720 

KVRRYRK IATSVTVYRGLTE 
TTAAGGTGKGCGCTATAGGAAAATTGCCAttrrCTGTCACGGTQ 780 

SA1TMKYELPRPVPLYEISH 
AGTCCGCCATCACCMCAAGTATGAACTCCCGAGACCCGTGCCACTCTATGAGATAAGCC 840 

MDSTYQCFSSMKVNVNGVEN 
ACATGGACAGCACCTATO^GTCCTTTAGTTCCATGAAGGTAAATGTCAAC 900 
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Figure 19(b) 



TFTDRDDVNTTVFLQPVEGL 
ACACATTTACT(^AGA)^G^TGTTMCACCACAGTATTCCTCCAACCAGTACAGGGGC 960 

TDMIQRYFSQPVIYAEPGWF 
TTACGGATMWTTCAAAGGTACTnAGCCAGCCGGTW^ 1020 

PGIYRVRTTVNCE IVDMIAR 
TTCCCGGCATATACAGAGTTAGGACCACTGTCMTTGCGAGATAGTQGACATGATAGCCA 1080 

> C 

SAEPYNYFVTSLGDTVEVSP 
GGTrCTGCTGAACCATACAATTACTTTGrrCACGTCACT 1140 

FCYNESSCSTTPSNKNGLSV 
CTTTTTGCTATAACGAATCaCATCCAGCACAACCCCCAG^ 1200 

QVVLNHTVVTYSDRGTSPTP 
TCCAAGTAGTTCT(^^C^OCTGTGGTCACGTACTCT6ACAGAGGAACCAGTCCCACTC 1260 

QNRIFVETGAYTLSWASESK 
CCCAAMCAGGATCITTGTGGAAACGGGAGCGTAWCGCTnCGTGGGCCTCCGAGAGW 1320 

TTAVCPLALWKTFPRS IQTT 
AGACCACGGCCGTGTGTCCGCTGGCACTGTBGAAMCCTTCCCGCGCTCCATCCAGACTA 1380 

HEOSFHFVANEITATFTAPL 
CCCACGAGGACAGCTTCCACrTTGTGGCCMCGAGATCACGGCCACCTra 1440 

TPVANFTDTYSCLTSDINTT 
TMCGCCAGTGGCCAACTTTACCGACACGTACTCnGTCTGACCTCGGATAT 1500 

LNASKAKLASTHV PNGTVQY 
CGCTTAACGCCAGCAAGGCCAAACTGGCGAGCACTCACGTCCCTAACGGGACGGTCCAGT 1560 

FHTTGGLYLVWQPMSAINLT 
ACTTCCACACAACAGGCGGACTCTA 1 1 1 GG I CT GGCAGCCCATGTCCGCGATTAACCTGA 1620 

HAQGDSGNPTSSPPPSASPM 
CTWCGCTCAGGGCGACAGaaGGAACCCCACGTW 1680 

TTSASRRKRRSASTAAAGGG 
TGACCACCTCTGCCAGCCGCAGAAAGAGACGGTCAGCWGTACCGCTGCTGCCGG^ 1740 

GSTDNLSYTQLQFAYDKLRD 
GGGGGTCWCGGACAACCTCT{TTACACGCAGCTGCAGTTTGCCTAC^ 1800 

GINQVLEELSRAWCREQVRD 
ATGGCATTAATCAGGTGTTAGAAGAACTCTCCAGGGCATGGTGTCGCGAGCAGGTCAGGG 1860 
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Figure 19(g) 



NLMWYE LSKINPTSVMTAIY 
ACAACCTMTGTGGTACGAGCTCAGTAAMTCAACCCWCCAGCGTTATCACAGCCATCT 1920 

GRPVSAKFVGDAISVTECIN 
ACGGTCGACCTGTATCCGCCAAGiTTCGT AG6AGACGCCATTTCCGTGACCGAGTGCATTA 1980 

VDQSSVNIHKSLRTNSKDVC 
ACGTGGACCAGAGCTCCGTAAACATCCACAAGAKCTCA^ 2040 

YARPLVTFKFLNSSNLFTGQ 
SnACGCGCGCCCCCTGGTGACGTTTAAGTTTTTGAACAGTTCXA^ 2100 

LGARNEI ILTNNQVETCKOT 
AGCTCGGCGCGCGCMTGAGATMTACTGACCAACAACCAGGTGGAMCCTGCAAAGACA 2160 

CEHYFITRNETLVYKDYAYL 
CCTGCGAACACT ACTTCATCACCCGCAACGAGACTCTGGTGTrATAAGGACTACGCGTACC 2220 

RTINTTDISTLNTFIALNLS 
TGCGCACTATAMCACCACTGACATATCCACCCTGAACACTTTTATCGCCCTGAATCTAT 2280 

F1QNIDFKAIELYSSAEKRL 
CCTTT AnCAAMCATAGACTTCAAGGCCATCGAGCTGT^ACAGCAGTGCAGAGAAACGAC 2340 

ASSVFDLETMFREYNYYTHR 
TCGCGAGTAGCGTGTTTGACCTGGAGACWTGTTCAGGGAGTACMCTACTACACACATC 2400 

LAGLREDLDNT1DMNKERFV 
GTCTCGCGGGTTTGCGCGAGGATCTGGACAACACCATA6ATATGAACAAGGAGCGCTTCG 2460 

RDLSE I VADLGGIGKTVVNV 
TAAGGGACTTGTCG6AGATAGTGGCGGACCTGG6TGGCATCGGAAAAACGGTGGTGAACG 2520 

> T 

ASSVVTLCGSLVTGFINFIK 
TGGCCAGCAGCGTGGTCACTCTATGTGGCTCATTGGTTACCGGATTCATAMTTTTATTA 2580 

HPLGGMLMIIIVIAIILIIF 
AACACCCCCTAGGTGGCATGCTGATGATCATTATCGnATAGCAATCATCCTGATCATTT 2640 

MLSRRTNT1AQAPVKH I YPD 
nATGCT(^\GTCGCCGCACCMTACCATAGCCCAGGCGCCGGTGAAGATGATQ 2700 

VORRAPPSGGAPTREEIKNI 
ACGTAGATQjWGGGCACCTCCTAGCGGCGGAGCCCC^CACGGGAGGAAATCAAAAACA 2760 
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Figure 19(d) 



L'LGMHQLQQEERQKADDLKK 
TCCTGCTCG6AATGCACCAGCTACAACAAGAGGAGAGGCAGAAGGCGGATGATCTGAAAA 2820 

> T 

STPSVFQRTANGLRQRLRGY 
AAACTACACCCTCGGTUTTTCAGCCT 2880 

> L 

KPLTQSL0ISPET6E* 
ATAAACCTCTGACTCAATCGCTAGACATCAGTCCGGM^ 2940 

> T 

GAGGTTATTGTTTGATGTAAATTrAGGAAACACGGCCCGCCTCTGAAGCACCACATACAG 3000 

. DAM Polymerase Gene 
H D 

ACTGCAGnATCMCCCTACTCGrTTGCACACAGACACAAATTACCGTCCGCAGATCATGG 3060 

FFNPFI DPTRGGPRMTVRQP 
Al 1 1 1 1 1 CMTCCATTTATCGACCCAACTCGCGGAGGCCCGAGAAACACrGTGAGGCAAC 3120 

TPSQSPTVPSETRVCRLIPA 
CCACGCCGTCACAGTCGCCAACTCTCCCCTCGGAW 3180 

CFQTPGRPGVVAVDTTFPPT 
CCTGTTTCO^CCCCGGGGCGACCCGGCGTGGTTGCCGTGGACACCACATTTCCACCCA 3240 

YFQGPKRGEVFAGETGS IWK 
CCTACTTCCAGGGCCCCAAGCGGG6AGAAGTATTCGCGGGAGAGACTGGGTCTATCTGGA 3300 

TRRGQARNAPHSHLIFHVY D 
AAACAAGGCGCGGA(^GGCACG(^TGCTCCTATCTCGCACCTCATATTCCACGTATACG 3360 

IVETTYTADRCEDVPFSFQT 
ACATCGTGGAGACCACaACACGGCCGACCGCTGCGAGGACGTGCCAm 3420 

DI I PSGTVLKLLGRTLDGAS 
CTWATCAWCCCAGCGGCACCGTCCTCAAGCTGCTCGGOVGAACACT 3480 

VCVNVFRQRCYFYTLAPQGV 
6TGTCTGCCT6AACGTTTTCAGGCAGCGCTGCTAOT 3540 

NLTHVLQQALQAGFGRASCG 
TAAACCTGACCWCGTCCTCCAGCAGGCCCTCCACGCTGGCTTCGGTCGCGCATCCTGCG 3600 



F S T 

GCTTCTCCACCG 3612 
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20 40 60 80 



VRRYIKIATS VTVYRGMTEA AITNKYEIPR PVPLYEISHM DSTYQCFSSM KIWNGVENT 60 



FTDRDDVNKT VFLQPVEGLT DNIQRYFSQP VLYSEPGWFP GIYRVG 106 



23/26 

SUBSTTTUTE SHEET (RULE 26) 



WO 97/12042 



PCT/US96/15702 



Figure 21 




VRRYRKIATS VTVYRGLTES AITNKYELPR PVPLYEISHM DSTYQCFSSM KVNVNGVENT 60 



# 

FTDRDDVNTT VFLQPVEGLT DNIQRYFSQP VIYAEPGWFP GIYRVR 106 



24/26 



WO 97/12042 



PCT/US96/15702 




25/26 



WO 97/12042 PCT/US96/15702 

Figure 23 



VYKKNI VPNNFKVRRYRKVA 



TPVTLYRGMTDAAITNKYEI 
AprenCTrjOCTr rAcra^ 120 



PRPVPLYEISHMDSTYQCFS 
rrn^rrPCTArXJ>rrATArj^(^TCA(rrr^CATG^CAGCACCTACCAGTGCTTTAGT 180 



SHKIVVN6VENTFTGRD0VN 
TTrATEAAAATTCT AgTfiAACGfiACTCfiAAAACACGTTCACC 240 



KSV F LQPVEGLTDNI KRYFS 
AAAAfy(rrATTTCTrr.Afyr.ACTf^AAf^fTAAr.T^r^r.ATAMGA^TACTTTAGC 300 



QPVLYSEPGWFPGIYRVRTT 
rj^MCTOTATATTr^T^CC^ 360 



VNCEIVDH 

GTTAATTGTGAGATTGTAGACATGTT 380 



26/26 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

^I-REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: : 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



